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METHODS OF DIAGNOSIS OF BLADDER CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF BLADDER CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application is related to USSN 60/302,814, filed July 3, 2001; USSN 60/310,099, 

filed August 3, 2001; USSN 60/343,705, filed November 8, 2001; USSN 60/350,666, filed 
November 13, 2001; and USSN 60/372,246, filed April 12, 2001, each of which is 
incorporated herein by reference. 

1 0 FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein expression 
profiles and nucleic acids, products, and antibodies thereto that are involved in bladder 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of bladder cancer. The invention further relates to methods for 
1 5 identifying and using agents and/or targets that inhibit bladder cancer. 

BACKGROUND OF THE INVENTION 
In the United States, over 50,000 new cases of bladder cancer are diagnosed annually, 
and more than 10,000 deaths will be attributed to bladder cancer. Bladder cancer is now the 
20 fourth most common cancer among American men and the ninth most common cancer 

among American women. It occurs three times more frequently in men than in women, and it 
occurs roughly twice more firequently in white versus black men. 

Bladder cancer rarely occurs in people younger than 40 years of age, being primarily 
a disease of older men. Nonetheless, bladder cancer is a significant cause of illness and death 
25 in the United States. The risk of bladder cancer increases steeply with age, with over half of 
all bladder cancer deaths occurring after age 70. In white men older than 65, the annual 
disease rate of bladder cancer is approximately 2 cases per 1,000 persons; this contrasts with 
a rate of 0.1 cases per 1,000 persons yoimger than 65. 
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Within the United States, bladder cancer rates are higher among people who reside in 
northern versus southern states, and is higher for people who live in urban versus rural areas. 
Although this difference suggests that environmental as well as genetic factors may 
contribute to the development and progression of the disease, other studies confirm that 

5 certain genes play a role in bladder cancer.. For example, expression of the tumor suppressor 
gene p53 has been associated witli an adverse prognosis for patients with invasive bladder 
cancer. A retrospective study of 243 patients treated by radical cystectomy found that the 
presence of nucleai- p53 was an independent predictor for recurrence among patients with mid 
to late stage tumors. Esrig, et al (1994) N.E.J. Med. 331:1259-64. 

1 0 Urinary bladder cancers represent a spectrum of diseases that can be grouped into 

three general categories: superficial, invasive, and metastatic. The prognosis for treatment is 
highly dependent on the stage at which the tumor is first diagnosed. A unique aspect of 
bladder cancer treatment is that repeated surgical biopsy is an integral part of routine patient 
management. This has permitted the conduct of molecular genetic studies of tumors firom 

1 5 specific stages of the disease. The results of these studies suggest that bladder cancers 

develop and progress along at least two discrete pathways, which may account for differences 
in invasiveness and metastatic' potential. Incorporating molecular genetic factors into the 
current paradigm for diagnosis and treatment wiU optimize the probability of cure and allow 
the quality of Hfe for bladder cancer patients to be maintained. 

20 Early detection and treatment can prevent reoccurrence and progression of the disease 

to an incurable stage. Thus, the identification of novel diagnostic markers and therapeutic 
targets will improve the current treatment of bladder cancer patients. While industry and 
academia have identified novel sequences, there has not been an equal effort exerted to 
identify the flinction of these novel sequences in disease states. The elucidation of a role for 

25 novel proteins and compounds in disease states for identification of diagnostic markers and 
therapeutic targets is essential for improving the current treatment of bladder cancer patients. 
Accordingly, provided herein are methods that can be used in diagnosis and prognosis of 
bladder cancer. Additionally, provided herein are molecular targets for tlierapeutic 
intervention in bladder cancer and other related bladder diseases.. Further provided are 
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methods that can be used to screen candidate bioactive agents for the ability to modulate 
bladder cancer. 

SUMMARY OF THE INVENTION 
5 The present invention therefore provides nucleotide sequences of genes that are up- 

and down-regulated in bladder cancer cells. Such genes are useful for diagnostic purposes, 
and also as targets for screening for therapeutic compounds that modulate bladder cancer, 
such as hormones or antibodies. Other aspects of the invention will become apparent to the 
skilled artisan by the following description of the invention. 
1 0 In one aspect, the present invention provides a method of detecting a bladder cancer- 

associated transcript in a ceU from a patient, tiie method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown m Tables lA-13. 

In one embodiment, the present invention provides a method of determining the level 
15 of a bladder cancer associated transcript in a cell from a patient, 

In one embodiment, the present invention provides a method of detecting a bladder 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables lA-13, 
20 In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 

95% identical to a sequence as shown in Tables lA-13. 

hi one embodiment, the biological sample is a tissue sample. In another embodiment, 
the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. 
25 In one embodiment, the polynucleotide is immobiUzed on a sohd surface. 

In one embodiment, the patient is imdergoing a therapeutic regimen to treat bladder 
cancer. In another embodiment, the patient is suspected of having metastatic bladder cancer. 
In one embodiment, the patient is a human. 

In one embodiment, the bladder cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of bladder cancer, the method comprising the steps of: (i) providing 
a biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a bladder cancer- associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1 A-13, thereby monitoring the efficacy of the 
therapy. In a flirther embodiment, the patient has metastatic bladder cancer. In a further 
embodiment, ttie patient has a drug resistant form of bladder cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the bladder cancer-associated transcript to a level of the bladder cancer-associated 
transcript in a biological sample firom the patient prior to, or earlier in, the therapeutic 
treatment. 

Additionally, provided herein is a method of evaluating the effect of a candidate 
bladder cancer drug comprising administering the drug to a patient and removing a cell 
sample firom the patient. The expression profile of the cell is then determined. This method 
may further comprise comparing the expression profile to an expression profile of a healthy 
individual. In a preferred embodiment, said expression profile includes a gene of Tables lA- 
13. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables lA-13. 

In one embodiment, an expression vector or cell comprises the isolated nucleic acid. 

In one aspect, the present invention provides an isolated polypeptide which is encoded 
by a nucleic acid molecule having polynucleotide sequence as shown in Tables lA-13. 

In another aspect, the present invention provides an antibody that specifically binds to 
an isolated polj'peptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables lA-13. 

In one embodiment, the antibody is conjugated to an effector component, e.g., a 
fluorescent label, a radioisotope or a cytotoxic chemical. 
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In one embodiment, the antibody is an antibody fragment. In another embodiment, 
tlie antibody is humanized. 

In one aspect, the present invention provides a method of detecting a bladder cancer 
cell in a biological sample from a patient, the method comprising contacting the biological 
5 sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting antibodies 
specific to bladder cancer in a patient, the method comprising contacting a biological sample 
from the patient with a polypeptide encoded by a nucleic acid comprising a sequence from 
Tables lA-13. 

1 0 In another aspect, the present invention provides a method for identifying a compound 

that modulates a bladder cancer-associated polypeptide, the method comprising the steps of: 
(i) contacting the compound with a bladder cancer-associated polypeptide, the polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A-13; and (ii) determining the functional effect of the 

1 5 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host ceU or cell 
membrane. In another embodiment, the polypeptide is recombinant. 
20 In one embodiment, the functional effect is determined by measuring ligand binding 

to the polypeptide. 

In another aspect, the present invention provides a method of inliibiting proliferation 
of a bladder cancer-associated cell to treat bladder cancer in a patient, the method comprising 
the step of administering to the subject a therapeutically effective amount of a compound 
25 identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having bladder cancer or to a 
cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
30 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 
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as shown in Tables 1A-I3in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
the level of expression of the polynucleotide is a candidate for the treatment of bladder 
cancer. 

5 In one embodiment, the control is a mammal with bladder cancer or a cell sample 

therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
10 periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables lA-13 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
1 5 preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a mammal 
having bladder cancer comprising administering a compound identified by the assay 
described herein. 

hi another aspect, the present invention provides a pharmaceutical composition for 
20 treating a mammal having bladder cancer, the composition comprising a compound identified 
by the assay described herein and a physiologically acceptable excipient. 

In one aspect, the present invention provides a method of screening daig candidates 
by providing a cell expressing a gene that is up- and down-regulated as in a bladder cancer. 
In one embodiment, a gene is selected from Tables 1 A-13. The method further includes 
25 adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the 
level of expression in the absence of the drug candidate to the level of expression in the 
presence of the drug candidate, wherein the concentration of the drug candidate can vary 
30 when present, and wherein the comparison can occur after addition or removal of the drug 

6 
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candidate. In a preferred embodiment, the cell expresses at least two expression profile 
genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate bladder cancer drug 
comprising administering the drug to a transgenic animal expressing or over-expressing the 
5 bladder cancer modulatory protein, or an animal lacking the bladder cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments 
of Tables 1 A-13, wherein tlie biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
1 0 nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with bladder cancer is 
provided. The method comprises determining the expression of a gene of Tables 1 A- 13 in a 
first tissue type of a first individual, and comparing the distribution to the expression of the 
gene from a second normal tissue type firom the first individual or a second imaffected 
1 5 mdividual. A difference in the expression indicates that the first mdividual lias a disorder 
associated with bladder cancer. 

In a further embodiment, tlie biochip also includes a polynucleotide sequence of a 
gene that is not up- and down-regulated in bladder cancer. 

In one embodiment a method for screening for a bioactive agent capable of interfering 
20 with the binding of a bladder cancer modulating protein (bladder cancer modulatory protein) 
or a fragment thereof and an antibody which binds to said bladder cancer modulatory protein 
or fragment thereof. In a preferred embodiment, the method comprises combining a bladder 
cancer modulatory protein or fragment thereof, a candidate bioactive agent and an antibody 
which binds to said bladder cancer modulatory protein or fragment thereof The method 
25 further includes detemiining the binding of said bladder cancer modulatory protein or 
fragment thereof and said antibody. Wherem there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits bladder cancer. 

Also provided herein are methods of ehciting an immune response in an mdividual. 
30 In one embodiment a method provided herein comprises administering to an individual a 
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composition comprising a bladder cancer modulating protein, or a fragment thereof. In 
another embodiment, the protein is encoded by a nucleic acid selected from those of Tables 
lA-13. 

Further provided herein are compositions capable of ehciting an immune response in 
5 an individual. In one embodiment, a composition provided herein comprises a bladder cancer 
modulating protein, preferably encoded by a nucleic acid of Tables lA-13 or a fragment 
thereof, and a pharmaceutically acceptable carrier. In another embodiment, said composition 
comprises a nucleic acid comprising a sequence encoding a bladder cancer modulating 
protein, preferably selected from the nucleic acids of Tables lA-13, and a pharmaceutically 

10 acceptable carrier. 

Also provided are methods of neutralizing the effect of a bladder cancer protein, or a 
fragment thereof, comprising contacting an agent specific for said protein with said protein in 
an amount sufficient to effect neufralization. In another embodiment, the protein is encoded 
by a nucleic acid selected from those of Tables 1 A-13. 

1 5 In another aspect of the invention, a metho d of treating an individual for bladder 

cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a bladder cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having bladder cancer an antibody to a bladder 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

20 be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE im^ENTION 
In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for bladder disease (BD), e.g., cancer, 
25 including metastatic bladder cancer, as well as methods for screening for compositions which 
modulate bladder diseases. Also provided are methods and compositions for treating bladder 
disease. Various related conditions where these markers may be usefijl also, include, e.g., 
carcinoma in situ, various stages of papillary carcinomas; and such conditions in different 
stages, layers, structural portions, etc. 

8 
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Recent advances in molecular medicine, generally, have increased the interest in 
tumor-specific cell surface antigens that could serve as diagnostic or prognostic markers, or 
as targets for various immunotherapeutic or small molecule strategies. Antigens suitable for 
immunotherapeutic strategies should be highly expressed in cancer tissues and ideally not 
5 expressed in other, e.g., normal, adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated, as a physiological consequence of such expression would be 
limited. Examples of such antigens in cancers other than bladder cancer include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(HerceptinO/trastuzumab) are currently in use for the treatment of metastatic breast cancer. 
10 Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodglcin's lymphoma. 
Maloney, et al. (1997) Blood 90:2188-2195; and Leget and Czuczman (1998) Curr. Opin. 
Oncol. 10:548-551. 

15 Definitions 

The term "bladder cancer protein" or "bladder cancer polynucleotide" or "bladder 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 

20 preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 

sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables lA-13; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an 
immunogen comprising an amino acid sequence encoded by a nucleotide sequence of or 

25 associated with a gene of Tables 1 A- 1 3, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables lA-13 and conservatively modified variants thereof; or (4) 
have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 

30 99%o or greater amino sequence identity, preferably over a region of over a region of at least 
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about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded 
by a nucleotide sequence of or associated with a gene of Tables lA-13. A polynucleotide or 
polypeptide sequence is typically from a mammal including, but not limited to, primate, e.g., 
human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A 
5 "bladder cancer polypeptide" and a "bladder cancer polynucleotide," include both naturally 
occurring or recombinant forms. 

A "full length" bladder cancer protein or nucleic acid refers to a bladder cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type bladder cancer 

10 polynucleotide or polypeptide sequences. The "foil length" may be prior to, or after, various 
stages of splicing, including alternative splicing, or post-translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid, e.g., that 
contains nucleic acids or polypeptides of a bladder cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 

15 humans, or rodents, e.g., mice and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections talcen for histologic purposes, 
blood, plasma, serum, sputum, stool, urine, tears, mucus, hair, skin, etc. Biological samples 
also include explants and primary and/or transformed cell cultures derived from patient 
tissues. A biological sample is typically obtained from a eukaryotic organism, most 

20 preferably a mammal such as a primate, e.g., chimpanzee or human; cow; dog; cat; a rodent, 
e.g., guinea pig, rat, or mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 

25 isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention m vivo. Archival tissues, havmg treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids 
or polypeptide sequences, refer to two or more sequences or subsequences that are the same 

30 or have a specified percentage of amino acid residues or nucleotides that are the same (e.g., 

10 
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about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned 
for maximum correspondence over a comparison window or designated region) as measured 
using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters 
5 described below, or by manual alignment and visual inspection (see, e.g., NCBI web site 
http://wAVW.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to be 
"substantially identical." This definition also refers to, or may be applied to, the compliment 
of a test sequence. The definition also includes sequences that have deletions and/or 
additions, substitutions, naturally occuiring valiants, e.g., polymorphic or allelic, and man- 

1 0 made variants. As described below, tlie preferred algorithms can account for gaps and the 
■ like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to 

1 5 which test sequences are compared. When using a sequence comparison computer algoritiam, 
test and reference sequences, subsequence coordinates, and sequence algorithm program 
parameters are typically designated. Default or alternative program parameters can be 
selected. The sequence comparison algorithm then calculates the percent sequence identities 
for the test sequences relative to the reference sequence, based on the program parameters. 

20 A "comparison window", as used herein, includes reference to a segment of one of the 

number of contiguous positions selected fi:om the group consisting typically of fi:om about 
20-600, usually about 50-200, more usually about 100-150 in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally ahgned. Methods of alignment of sequences for comparison are 

25 well-lcaown in the art. Optimal alignment of sequences for comparison can be conducted, by, 
e.g., the local homology algorithm of Smith and Waterman ( 198 1) Adv. Appl. Math. 2:482, 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443- 
453, the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l Acad. Sci. 
USA 85:2444-448, computerized implementations of these algorithms (GAP, BESTFIT, 

30 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WT), or manual alignment and visual inspection (see, e.g., 
Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology 
Lippincott. 

Preferred algorithms suitable for determining percent sequence identity and sequence 
5 similarity include the BLAST and BLAST 2.0 algorithms. See Altschul, et al. (1977) Nuc. 
Acids Res. 25:3389-3402; and Altschul, et al. (1990) IMoLBioL 215:403-410. BLAST and 
BLAST 2.0 are used, with the parameters described herein, to determine percent sequence 
identity for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 

10 (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive- valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul, et al., supra). These initial neighborhood word hits act as seeds for 

1 5 initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, e.g., for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

20 calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 

25 program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation 
(E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Nat'l Acad. Sci. USA 
89:10915-919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of 

30 both strands. 

12 
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The BLAST algorithm also performs a statistical analysis of the similarity between 
two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an mdication of the probability by which a match between 
5 two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and most preferably less than about 0.001. Log values maybe large 
negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 

10 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised agamst the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

15 indication that two nucleic acid sequences are substantially identical is tliat the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to ampUfy the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

20 expression vector and supports the repHcation or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells maybe 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

25 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free fi:om components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

30 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
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some open reading frames that naturally flanlc the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments typically denotes that a 
nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. Preferably, 
it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% 
5 pure, and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The temis apply to amino acid polymers in which 

10 at least one amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and a non-naturally occurring amino acid polymer. 

The term "amino acid" embraces naturally occurring or synthetic amino acids, amino 
acid analogs, and amino acid mimetics. Naturally occurring amino acids are those encoded 

15 by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 
y-carboxyglutamate, and 0-phosphoserine. Amino acid analogs include compounds that 
share a basic chemical structure with a naturally occurring amino acid, e.g., an a carbon that 
is bound to a hydrogen, a carboxyl group, an amino group, or an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionme methyl sulfonium. Such analogs may have 

20 modified R groups (e.g., norleucine) or modified peptide backbones, but share some basic 
chemical structure with a naturally occurring amino acid. Amino acid mimetics include 
chemical compounds that have a stiiicture that is different from the general chemical 
structure of an amino acid, but that fimction similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by their commonly laiown thixe letter symbols 

25 or by the one-letter symbols recommended by the lUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by their commonly accepted single- 
letter codes. 

"Conservatively modified variants" applies to amino acid or nucleic acid sequences. 
With respect to particular nucleic acid sequences, conservatively modified variants refers to 
30 those nucleic acids which encode identical or essentially identical amino acid sequences, or 

14 
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where the nucleic acid does not encode an amino acid sequence, to essentially identical or 
associated, e.g., naturally contiguous, sequences. Because of the degeneracy of tlie genetic 
code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Thus, at 
5 each position where an alanine is specified by a codon, the codon can be altered to another of 
the con-esponding codons described without ahering the encoded polypeptide. Such nucleic 
acid variations are "silent variations," which are one species of conservatively modified 
variations. Each nucleic acid sequence herein which encodes a polypeptide also describes 
silent variations of the nucleic acid. One of skill will recognize that in certain contexts each 

10 codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 
functionally identical molecule. Accordingly, often silent variations of a nucleic acid which 
encodes a polypeptide is implicit in a described sequence with respect to the expression 
product, but not with respect to actual probe sequences. 

15 As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions, or additions to a nucleic acid, peptide, polypeptide, or protem sequence which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 
substitution of an amino acid with a chemically similar amino acid. Conservative substitution 

20 tables providing flinctionally similar amino acids are well known in the art. Such 

conservatively modified variants are in addition to and do not exclude polymorphic variants, 
interspecies homologs, and alleles of the invention. Typically conservative substitutions for 
one another include: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) 
Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysme (K); 5) Isoleucine (I), Leucine (L), 

25 Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine 
(S), Threonine (T); and 8) Cysteine (C), Methionine (M). See, e.g., Creighton (1984) 
Proteins: Structure and Molecular Properties Freeman. 

Macromolecular structures such as polypeptide structures can be described in terms of 
various levels of organization. See, e.g., Alberts, et al. (eds. 2001) Molecular Biology of the 

30 Cell (4th ed.) Garland; and Cantor and Schimmel (1980) Biophysical Chemistry Part I: Tlie 
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Conformation of Biological Macromolecules Freeman. "Primary structure" refers to the 
amino acid sequence of a particular peptide. "Secondary structure" refers to locally ordered, 
three dimensional structures withm a polypeptide. These structures are commonly known as 
domains, which are portions of a polypeptide that often form a compact unit of the 
5 polypeptide, and are typically about 25-500 ammo acids long. Typical domains are made of 
sections of lesser organization such as stretches of p-sheet and a-lieUces. "Tertiary structure" 
refers to the complete three dimensional structure of a polypeptide monomer. "Quaternary 
structure" refers to the thi-ee dimensional structure formed, usually by the noncovalent 
association of independent tertiary units. Anisotropic terms are also known as energy terms. 

1 0 "Nucleic acid" or "oligonucleotide" or "polynucleotide" or granmiatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are polymers, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 

1 5 nucleic acid of the present invention will generally contain phosphodiester bonds. In some 
cases, nucleic acid analogs are included that may have alternate backbones, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-963 and references therem; 
Letsinger (1970) J. Org. Chem. 35:3800-803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 

20 805; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-471; and Pauwels, et al. (1986) 
Chemica Scripta 26:141-149); phosphorothioate (Mag, et al. (1991), Nucleic Acids Res. 
19:1437-441; and U.S. Patent No. 5,644,048); phosphorodithioate (Brill, et al. (1989) J.Am. 
Chem. Soc. 1 1 1 :2321-322); 0-methylphophoroamidite linlcages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach Oxford Univ. Press); and peptide 

25 nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. Soc. 1 14: 1895-897; 
Meier, et al. (1992^ Chem. hit. Ed. Engl. 31:1008-010: Nielsen (1993) Nature 365:566-568; 
Carlssoii, et al. (1996) Nature 380:207. Other analog nucleic acids include those with 
positively charged backbones(Denpcy, et al. (1995) Proc. Nat'l Acad. Sci. USA 92:6097- 
101); non-ionic backbones (U.S. Patent Nos. 5,386,023; 5,637,684; 5,602,240; 5,216,141; 

30 and 4,469,863; Kiedrowshi, et al. (1991) Angew. Chem. Intl. Ed. English 30:423-426; 
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Letsinger, et al. (1988) J. Am. Chem. Soc. 110:4470-471; Jung, et al. (1994) Nucleoside and 
Nucleotide 13:1597-xxx; Chapters 2-3 of Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et al. (1994) 
Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular NMR 
5 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743-xxx); and non-ribose backbones (see U.S. 
Patent Nos. 5,235,033 and 5,034,506, and Chapters 6-7 of Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ACS Symposium Series 580. Nucleic 
acids containing one or more carbocyclic sugars are also contemplated. See Jenkins and 
Turner (1995) Chem. Soc. Rev. 24:169-176. Several nucleic acid analogs are described in 

10 Rawls (page 35, June 2, 1997) C&E News . Modifications of the ribose-phosphate backbone 
may be made, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic adds and analogs may be made. 

1 5 Particularly preferred are peptide nucleic acids (PNA) which include peptide nucleic 

acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
The PNA backbone typically exhibits improved hybridization kinetics, exhibiting larger 
changes in the melting temperature (Tj^) for mismatched versus perfectly matched basepairs. 

20 DNA and RNA typically exhibit a 2-4° C drop in for an internal mismatch. With the 
non-ionic PNA backbone, the drop is closer to 7-9° C. And due to their non-ionic nature, 
hybridization of the polymers is relatively insensitive to salt concentration, hi addition, 
PNAs are not as easily degraded by cellular enzymes, and can be more stable. 

The nucleic acids may be single stranded or double stranded, or contain portions of 

25 both double stranded or single stranded sequence. As will be appreciated by those in the art, 
the depiction of a single strand also defines the sequence of the complementary strand; thus 
the sequences described herem also provide the complement of the sequence. The nucleic 
acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may 
contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases, 

30 including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine, hypoxanthine, 
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isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally occurring RNA, e.g., 
a pre-mRNA, hnRNA, or mRNA. As used herein, the term "nucleoside" includes nucleotides 
and nucleoside and nucleotide analogs, and modified nucleosides such as amino modified 
nucleosides. In addition, "nucleoside" includes non-naturally occurring analog structures. 
5 Thus, e.g., the individual units of a peptide nucleic acid, each containing a base, are referred 
to herein as a nucleoside. 

A "label" or "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, chemical, or other physical means. Direct or 
indirect methods are comtemplated. For example, useful labels include ^^P, fluorescent 

10 dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, 

digoxigenin, or haptens and proteins or other entities which can be made detectable, e.g., by 
incorporating a radiolabel into the peptide or used to detect antibodies specifically reactive 
with the peptide. The labels may be incorporated into the bladder cancer nucleic acids, 
proteins, and antibodies. Methods are well known for conjugating the antibody to the label, 

1 5 including those methods described by Hunter, et al. (1962) Nature 144:945-946; David, et al. 
(1974)' Biochemistry 13:1014-021; Pam, et al. (1981) J. Immunol. Meth. 40:219-230; and 
Nygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is bound 
(or linked, or conjugated), either covalently, tlirough a linker or a chemical bond, or 

20 noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to a target, 
e.g., an antibody. The "effector" can be a variety of molecules including, e.g., detection 
moieties including radioactive compounds; fluorescent compounds; an enzyme or substrate; 
tags such as epitope tags; a toxin; activatable moieties; a chemotherapeutic agent; a lipase; an 
antibiotic; a radioisotope emitting "hard", e.g., beta radiation; or an attracting moiety. 

25 A "labeled nucleic acid probe or oUgonucleotide" is one that is bound, either 

covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe maybe 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 

30 partners binds to the other, e.g., biotin, streptavidin. 
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As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid 
capable of binding to a target nucleic acid of complementary sequence through one or more 
types of chemical bonds, usually through complementary base pairing, usually through 
hydrogen bond formation. As used herein, a probe may include natural (e.g., A, G, C, or T) 
5 or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be 
joined by a linkage other than a phosphodiester bond, so long as it does not functionally 
interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. Probes 
may bind target sequences lacking complete complementarity with the probe sequence 

10 depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which streptavidin linked label may bind. By assaying for the 
presence or absence of the probe, one can detect the presence or absence of the select 
sequence or subsequence. Diagnosis or prognosis may be based at the genomic level, or at 

1 5 the level of RNA or protein expression. 

Tlie term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived firam a cell so modified. Thus, e.g., recombinant 

20 cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnomally expressed, under expressed, or not 
expressed at all. By the terai "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this manner, 

25 operable linlcage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and remtroduced into a host cell or 
organism, it will replicate non-recombinantly, e.g., using in vivo cellular machinery of the 

30 host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
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recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, e.g., through the expression of a recombinant nucleic 
acid as depicted above, 

5 The term "heterologous" when used with reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not normally 

found in the same relationship to each other in nature. For instance, the nucleic acid is 
tj^ically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
1 0 coding region from another source. Similarly, a heterologous protein -will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 

1 5 sequences near the start site of transcription, such as, in the case of a polymerase 11 type 

promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

20 active under environmental or developmental regulation. The term "operably linked" refers 
to a fimctional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

25 An "expression vector" is a nucleic acid construct, generated recombinantly or 

sjmthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
franscribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 
different in different circiunstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in "Overview 

1 0 of principles of hybridization and the strategy of nucleic acid assays" in Tijssen (1993) 

Hybridization with Nucleic Probes (Techniques in Biochemistry and Molecular Biology; vol. 
24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tj^ 
is the temperature (imder defined ionic strength, pH, and nucleic acid concentration) at which 

1 5 50% of the probes complementary to the target hybridize to the target sequence at 

equilibrium (as the target sequences are present in excess, at T^-,, 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 

20 about 10-50 nucleotides) and at least about 60° C for long probes (e.g., greater tlian about 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
about two times background, preferably about 10 times background hybridization. 
Exemplary stringent hybridization conditions can be as following: 50% formamide, 5x SSC, 

25 and 1% SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 
0.2x SSC, and 0.1% SDS at 65° C. For PGR, a temperature of about 36° C is typical for low 
stringency ampHfication, although annealing temperatures may vary between about 32-48° C 
depending on primer length. For high stringency PGR amplification, a temperature of about 
62° G is typical, although high stringency annealing temperatures can range firom about 50- 
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65° C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90-95" C for 30-120 sec, 
an annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guidelines for low and high stringency amplification reactions are provided, 
5 e.g., in Innis, et al. (1990) PGR Protocols. A Guide to Methods and Applications Academic 
Press NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

1 0 permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least about 
twice background. Alternative hybridization and wash conditions can be utilized to provide 

15 conditions of similar stringency. Additional guidelines for determining hybridization 

parameters are provided in numerous references, e.g.,. Ausubel, et al. Current Protocols in 
Molecular Biology LippincotL 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a bladder cancer protein includes the determination of a parameter that is 

20 indirectly or directly under the influence of the bladder cancer protein or nucleic acid, e.g., a 
functional, physical, or chemical effect, such as the ability to decrease bladder cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cell viability, cellular proliferation; cellular 
transformation; growth factor or serum dependence; tiunor specific marker levels; 

25 invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 

expression in cells undergoing metastasis, and other characteristics of bladder cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 

30 bladder cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. 
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Such functional effects can be measured by many means known to those skilled in the art, 
e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive 
index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the bladder cancer protein; 
5 measuring binding activity or binding assaj's, e.g., binding to antibodies or other Ugands, and 
measuring cellular proliferation or metabolism. Determination of the functional effect of a 
compound on bladder cancer can also be performed using bladder cancer assays, such as, in 
vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact inhibition and 
density limitation of growth; cellular proliferation; cellular transformation; growth factor or 

10 serum dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth 
and metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and 
other characteristics of bladder cancer cells. Functional effects can be evaluated by many 
means, e.g., microscopy for quantitative or qualitative measures of alterations in 
morphological features, measurement of changes in RNA or protein levels for bladder 

1 5 cancer-associated sequences, measurement of RNA stability, identification of downstream or 
reporter gene expression (CAT, luciferase, p-gal, GFP, and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and Hgand binding assays. 

"Inhibitors", "activators", and "modulators" of bladder cancer polynucleotide and 

20 polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of bladder cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of bladder cancer proteins, e.g., antagonists. Antisense nucleic acids 

25 may seem to inhibit expression and subsequent flmction of the protein. "Activators" are 

compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate bladder cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of bladder cancer proteins, e.g., versions with altered activity, 
as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 

30 chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
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expressing the bladder cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of bladder cancer can also be identified by incubating 
bladder cancer cells with the test compound and determining increases or decreases in the 
5 expression of 1 or more bladder cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or 
more bladder cancer proteins, such as bladder cancer proteins encoded by the sequences set 
out in Tables lA-13. 

Samples or assays comprising bladder cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inliibitor, 

1 0 activator, or modulator to examine the extent of inhibition. Control samples (untreated with 
inhibitors) are assigned a relative protein activity value of 100%. Inliibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably about 
50%, more preferably about 25-0%. Activation of a bladder cancer polypeptide is achieved 
when the activity value relative to the control (untreated with activators) is about 110%, more 

1 5 preferably about 150%, more preferably about 200-500% (e.g., two to five fold higher 
relative to the control), more preferably about 1000-3000% higher. 

The phrase "changes in cell growth" refers to a change in cell growth and proliferation 
characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 

20 limitation of growth, loss of growth factor or serum requirements, changes in cell 

morphology, gaining or losing immortalization, gaining or losing timior specific markers, 
ability to fonn or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., pp. 231-241 of Freshney (1994) Culture of Animal 
Cells:A Manual of Basic Technique (3d ed.). 

25 "Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

30 spontaneously or foUowmg exposure to a carcinogen, thereby mutating an endogenous gene. 
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Transformation is associated Avith phenotypic changes, such as immortalization of cells, 
aberrant growth control, nomnorphological changes, and/or malignancy. See, Freshney 
(2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
"Antibody" refers to a polypeptide comprising a framework region from an 
5 immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad inmiunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 

1 0 IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul 
(ed. 1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 

1 5 "light" (about 25 kD) and one "heavy" cham (about 50-70 kD). The N-tenninus of each 

chain defines a variable region of about 100 to 1 10 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (Vl) and variable heavy chain (Vfj) 
refer to these Ught and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

20 fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to Vu-Cijl by a disulfide bond. The F(ab)'2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab 

25 with part of the hinge region. See Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 
While various antibody fragments are defined in terms of the digestion of an intact antibody, 
such fragments may be synthesized de novo either chemically or by using recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies, or those synthesized de novo using 
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recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage 
display libraries. See, e.g., McCafferty, et al. (1990) Nature 348:552-554. 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques can be used. See, e.g., Kohler and Milstein (1 975) Nature 
5 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. pp. 77-96 in 
-Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) 
Current Protocols in Immunologv Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Goding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 

10 (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

invention. Also, transgenic mice, or other organisms such as other mammals, may be used to 
express humanized antibodies. Alternatively, phage display technology can be used to 
identify antibodies and heteromeric Fab fragments that specifically bind to selected antigens. 
See, e.g., McCafferty, et al. (1990) Nature 348:552-554; and Marks, et al. (1992) 

15 Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, effector function, and/or 
species, or an entnely different molecule which confers new properties to the chimeric 

20 antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 

region, or a portion thereof, is altered, replaced, or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of bladder cancer-associated sequences 

25 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have a particular gene similarly expressed, the evaluation of a 
number of genes sunultaneously allows the generation of a gene expression profile that is 

30 characteristic of the state of the cell. That is, normal tissue (e.g., normal bladder or other 
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tissue) may be distinguished from cancerous or metastatic cancerous tissue of the bladder, or 
bladder cancer tissue or metastatic bladder cancerous tissue can be compared with tissue 
samples of bladder and other tissues from surviving cancer patients. By comparing 
expression profiles of tissue in known different bladder cancer states, information regarding 
5 which genes are important (including both up- and down-regulation of genes) ia each of these 
states is obtained. 

The identification of sequences that are differentially expressed in bladder cancer 
versus non-bladder cancer tissue allows the use of this information in a number of ways. For 
example, a particular treatment regime may be evaluated: does a chemotherapeutic drug act 

10 to down-regulate bladder cancer, and thus tumor growth or recurrence, in a particular patient; 
or does chemotherapy or radiation therapy induce expression of particular targets. Similarly, 
diagnosis and treatment outcomes may be done or confirmed by comparing patient samples 
with the known expression profiles. Metastatic tissue can also be analyzed to determine the 
stage of bladder cancer in the tissue or origin of a primary tumor. Furthermore, these gene 

1 5 expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimickmg or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the bladder cancer expression profile. This may be done by making biochips 
comprising sets of important bladder cancer genes, which can then be used m these screens. 
These methods can also be applied on the protein basis; that is, protein expression levels of 

20 the bladder cancer proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the bladder cancer nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense or inhibitory nucleic acids, or the 
bladder cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

25 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in bladder disease or cancer relative to normal tissues and/or non- 
malignant bladder tissue, herein termed "bladder cancer sequences." As outlined below, 
bladder cancer sequences include those that are up-regulated (e.g., expressed at a higher 
level) in bladder cancer, as well as those that are down-regulatfed (e.g., expressed at a lower 

30 level). In a preferred embodiment, the bladder cancer sequences are from humans; however, 
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as will be appreciated by those in the art, bladder cancer sequences from other organisms may 
be useful in animal models of disease and drug evaluation; thus, other bladder cancer 
sequences are provided, from vertebrates, including mammals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
5 horses, etc.) and pets, e.g., (dogs, cats, etc.). Bladder cancer sequences from other organisms 
may be obtained using the tecliniques outlined below. 

Bladder cancer sequences can include both nucleic acid and amino acid sequences. 
Bladder cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect natui-ally occurring nucleic acids, as well as 

1 0 screening applications. Biochips comprising nucleic acid probes or PGR microtiter plates 
with selected probes to the bladder cancer sequences can be generated. 

A bladder cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the bladder cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 

1 5 generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying bladder cancer-associated sequences, the bladder cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal, non- 
malignant, or cancerous tissues, or tumor tissue samples from patients who have metastatic 

20 disease vs. non metastatic tissue. Other suitable tissue comparisons include comparing 
bladder cancer samples with metastatic cancer samples from other cancers, such as lung, 
bladder, gastrointestinal cancers, ovarian, etc. Samples of different stages of bladder cancer, 
e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 

25 and treated for the preparation of roRNA. Suitable biochips are commercially available, e.g., 
from Affymetrix. Gene expression profiles as described herem are generated and the data 
analyzed. 

In one embodiment, genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
30 bladder, but also includmg, and not limited to lung, heart, brain, liver, bladder, kidney. 
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muscle, colon, small intestine, large intestine, spleen, bone, and placenta. In a preferred 
embodiment, those genes identified during the bladder cancer screen that are e}q)ressed in a 
significant amount in other tissues are removed from the profile, although in some 
embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
5 that the target be disease specific, e.g., not be expressed on critical organs. 

In a preferred embodiment, bladder cancer sequences are those that are up-regulated 
in bladder cancer; that is, the expression of these genes is higher in the bladder cancer tissue 
as compared to non-cancerous tissue. "Up-regulation" as used herein often means at least 
about a two-fold change, preferably at least about a three fold change, with at least about 

1 0 five-fold or higher being preferred. Unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and sequences of accession numbers 
are hereby expressly incorporated by reference. GenBank is known in the art, see, e.g., 
Benson, et al. (1998) Nuc. Acids Res. 26:1-7 and http.V/www.ncbi.nlm.nih.gov/. Sequences 
are also available in other databases, e.g., European Molecular Biology Laboratory (EMBL) 

15 and DNA Database of Japan (DDBJ). 

In another preferred embodiment, bladder cancer sequences are those that are down- 
regulated in the bladder cancer; that is, the e}q)ression of these genes is lower in bladder 
cancer tissue as compared to non-cancerous tissue (see, e.g.. Tables 1 A-13). "Down- 
regulation" as used herein often means at least about a two-fold change, preferably at least 

20 about a three fold change, with at least about five-fold or higher being preferred. 

Informatics 

The ability to identify genes that are over or under expressed in bladder cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 

25 of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with bladder cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation. See Anderson (June 1 1-12, 1998) Pharmaceutical Proteomics: 

30 Targets. Mechanism, and Function, paper presented at the IBC Proteomics conference. 
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Coronado, CA. Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
5 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that includes 
at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 

10 database of the invention can be maintained on an electronic device allowing for the storage 
of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent that similar databases can be assembled for 

1 5 assay data acquhed using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample midergoing bladder cancer, e.g., the identification of bladder cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

20 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linlcages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

25 An array of methods for indexing and retrieving biomolecular information is known 

in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein fimction hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records contaming 

30 information in a format that allows a collection of partial-length DNA sequences to be 
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catalogued and searched acqording to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
5 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a fiinctionality for multi-dimensional data analysis 

10 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a. hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

15 the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics CSH Press, NY; Durbin, et al. (eds. 
1999) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids 
Cambridge Univ. Press; Baxevanis and OeuUette (eds. 1998) Bioinformatics: A Practical 
Guide to the Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler 

20 (1 999) Bioinformatics: Basic Applications in Biological Science andMedicme CRC Press; 
Setubal, et al. (eds 1997) Introduction to Computational Molecular Biology Brooks/Cole: 
Misener and Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Oxford Univ. 
Press; Higgins and Taylor (eds. 2000) Bioinformatics: Sequence, Structure, and Databanks: A 
Practical Approach Oxford Univ. Press; Brown (2001) Bioinformatics: A Biologist's Guide to 

25 Biocomputing and the Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts 
and Techiuques Kaufinami Pub.; and Watemian (1995) Introduction to Computational 
Biology: Maps, Sequences, and Geaomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
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with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 
5 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for bladder cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 0 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

1 5 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

20 encoding a protein expression fmgerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

25 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 
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The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
5 in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linlced via a data linlc, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 
line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
10 least one network device (e.g., computer, disk anay, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
15 ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 
20 obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for aligimient and/or comparison of the assay results. 
Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
25 file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
30 the query target and results are output via an I/O device. For example, a central processor 
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can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 
molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
5 SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 
be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 
adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, such as that 
10 described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

15 

Characteristics of bladder cancer-associated proteins 

Bladder cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the bladder 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 

20 cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaluig pathways); aberrant expression of such 
proteins often resufts in unregulated or disregulated cellular processes (see, e.g., Alberts, et 
al. (1994) Molecular Biology of the Cell (3d ed.) Garland. For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

25 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 

30 protems of one or more structural motifs for which defined fimctions have been attributed. In 
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addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
5 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

10 sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enz3miatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 

15 Center in England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) 
Nuc. Acids Res . 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
322. 

In anotlier embodiment, the bladder cancer sequences are transmembrane proteins. 

20 Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. Tliey may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 

25 domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. 
30 For example, receptor tyrosine kmases, certain cytokine receptors, receptor guanylyl cyclases 

35 



wo U3/003906 



PCT/US02/21338 



and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 
5 7 membrane spamiing regions. Characteristics of transmembrane domains include 

approximately 1 7 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
localization and number of transmembrane domains within the protein may be predicted (see, 
e.g., PSORT web site http://psort.nibb.ac.jp/). hnportant transmembrane protein receptors 

1 0 include, but are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal growth factor 
receptor, low density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, 
and interleukin receptors, e.g., IL-1 receptor, JL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

1 5 motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or fimctions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domam include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like, For example, 

20 growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 
cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, neurotrophic factors and the like. Extracellular domains also bind to cell- 
associated molecules. In this respect, they mediate cell-cell interactions. Cell-associated 
ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or 

25 may themselves be transmembrane proteins. Extracellular domains also associate with the 
extracellular matrix and contribute to the maintenance of the cell structure. 

Bladder cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useflil 

30 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
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situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeabUzed to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
5 Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the bladder cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins may have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

1 0 proteins are involved in numerous physiological events; e.g., if circulating, they often serve 
to transmit signals to various other cell types. The secreted protein may function in an 
autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting on 
cells in close proximity to the cell that secreted the factor), an endocrine manner (acting on 
cells at a distance, e.g, secretion into the blood stream), or exocrine (secretion, e.g., through a 

1 5 duct or to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, 
lacrimal glands, mammary glands, wax producing glands of the ear, etc.). Thus secreted 
molecules often find use in modulating or altering numerous aspects of physiology. Bladder 
cancer proteins that are secreted or released proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, 

20 or urine tests. Those which are enzymes may be antibody or small molecule targets. Others 
may be useful as vaccine targets, e.g., via CTL mechanisms. 

Use of bladder cancer nucleic acids 

As described above, bladder cancer sequence is initially identified by substantial 
25 nucleic acid and/or amino acid sequence homology or linlcage to the bladder cancer 

sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, Unked sequences on a mRNA are found on 
the same molecule. 
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The bladder cancer nucleic acid sequences of the invention, e.g., the sequences in 
Tables lA-13, can be fragments of larger genes, e.g., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
5 sequences provided herein, extended sequences, in either direction, of the bladder cancer 
genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Ausubel, et al., supra. Much can be done by 
informatics and many sequences can be clustered to include multiple sequences 
corresponding to a single gene, e.g., systems such as UniGene (see, 

1 0 http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a bladder cancer nucleic acid is identified, it can be cloned and, if necessary, its 
constituent parts recombined to form the entire bladder cancer nucleic acid coding regions or 
the entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 

1 5 recombinant bladder cancer nucleic acid can be fiirther-used as a probe to identify and isolate 
other bladder cancer nucleic acids, e.g., extended coding regions. It can also be used as a 
"precursor" nucleic acid to make modified or variant bladder cancer nucleic acids and 
proteins. 

The bladder cancer nucleic acids of the present invention are used in several ways. In 
20 a first embodiment, nucleic acid probes to the bladder cancer nucleic acids are made and 

attached to biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, vaccine, and/or antisense/inhibition appUcations, 
Alternatively, the bladder cancer nucleic acids that include coding regions of bladder cancer 
proteins can be put into expression vectors for the expression of bladder cancer proteins, 
25 again for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to bladder cancer nucleic acids (both 
the nucleic acid sequences outlined in the tables and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to tlie bladder cancer nucleic acids, e.g., the target sequence (either the target 
30 sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
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hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be a number of base pair 
mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
5 is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

1 0 A nucleic acid probe is generally single stranded but can be partially single and 

partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

1 5 genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, {om or more probes, with three being preferred, are used to build in a redundancy for a 

20 particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 

25 solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidm to the support and the non-covalent binding of 

30 the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 



wo U3/003906 



PCT/US02/21338 



equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
5 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
10 the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified to contain 
discrete individual sites appropriate for the attachment or association of the nucleic acid 
probes and is amenable to at least one detection method. As will be appreciated by those m 

1 5 the art, the number of possible substrates are very large, and include, but are not Umited to, 
glass and modified or fianctionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, siHca or siUca- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 

20 plastics, etc. hi general, the substrates allow optical detection and do not appreciably 
fluoresce. See WO 00/55627. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 

25 volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

hi a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical fimctional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group uicluding, but not limited to, 

30 amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
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particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to sxurfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
5 catalog, technical section on cross-liiil<ers, pages 155-200). hi addition, in some cases, 

additional linlcers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
1 0 either the 5 ' or 3 ' terminus may be attached to the sohd support, or attachment may be via an 
internal nucleoside. 

hi another embodiment, the immobilization to the solid support may be very strong, 
yet non-covalent. For example, biotinylated ohgonucleotides can be made, which bind to 
surfaces covalently coated with streptavidin, resulting in attachment. 

1 5 Altematively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 
and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
situ, using well known photolithographic techniques, such as those described in WO 
95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 

2 0 within, all of which are expressly incorporated by reference; these methods of attachment 
form the basis of the Affimetrix GeneChip''''^ technology. 

Often, amplification-based assays are performed to measure the expression level of 
bladder cancer-associated sequences. These assays are typically performed in conjunction 
with reverse transcription, hi such assays, a bladder cancer-associated nucleic acid sequence 

25 acts as a template in an amphfication reaction (e.g.. Polymerase Chain Reaction, or PGR), hi 
a quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
measure of the amount of bladder cancer-associated RNA. Methods of quantitative 
amplification are well known to those of skill in the art. Detailed protocols for quantitative 
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PGR are provided, e.g., in Innis, et al. (1990) PGR Protocols: A Guide to Methods and 
Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
5 dye and a 3' quenching agent. The probe hybridizes to a PGR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PGR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

10 amplification. See, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com. 
Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LGR) (see Wu and Wallace (1989) Genomics 4:560-569; Landegren, et al. (1988) 
Science 241:1077-1080; and Baninger, et al. (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Nat'l Acad. Sci. USA 86:1173-1177), self-sustained 

15 sequence replication (Guatelli, et al. (1990) Proc. Nat'l Acad. Sci. USA 87:1874-1878), dot 
PGR, and linker adapter PGR, etc. 

Expression of bladder cancer proteins from nucleic acids 

In a preferred embodiment, bladder cancer nucleic acids, e.g., encoding bladder 
20 cancer proteins, are used to make a variety of expression vectors to express bladder cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art (see, e.g., 
Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression Systems Academic 
Press) and are used to express proteins. The expression vectors may be either self-replicating 
25 extracluomosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the bladder cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Gontrol sequences that are suitable for prokaryotes, e.g., include a 
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promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a fimctional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
5 operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation; two sequences 
may be operably linked if they ai e physically linlced on a single polynucleotide. Generally, 

1 0 "operably linlced" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enliancers do not have 
to be contiguous. Linlcing is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oUgonucleotide adaptors or hnkers are used in 
accordance with conventional practice. Transcriptional and translational regulatory nucleic 

1 5 acid will generally be appropriate to the host cell used to express the bladder cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 

20 sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
25 combine elements of more than one promoter, are useful in the present invention. 

In addition, an expression vector may comprise additional elements. The expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a procaryotic host for cloning and 
rephcation. For integrating expression vectors, the expression vector may contain at least one 
30 sequence homologous to the host cell genome, and preferably two homologous sequences 
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which flank the expression construct. The integrating vector may be directed to a specific 
locus in the host cell by selecting the appropriate homologous sequence for inclusion in the 
vector. Constructs for integrating vectors are well known (e.g., Fernandez and Hoeffler, 
supra). 

5 In addition, in a preferred embodiment, the expression vector contains a selectable 

marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The bladder cancer proteins of the present invention may be produced by culturing a 
host cell transformed with an expression vector under the appropriate condhions to induce or 

1 0 cause expression of the bladder cancer protem. Conditions appropriate for bladder cancer 
protein expression will vary with the choice of the expression vector and the host cell, and 
will be easily ascertained by one skilled in the art through routine experimentation or 
optimization. For example, the use of constitutive promoters in the expression vector will 
typically require optimizing the growth and proliferation of the host cell, while the use of an 

1 5 inducible promoter typically requires identifying the appropriate growth conditions for 
induction. Li addition, in some embodiments, the timing of the harvest is important. For 
example, the baculoviral systems used in msect cell expression are lytic viruses, and harvest 
time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fimgi, insect, and 

20 animal cells, including mammahan cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli. Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THPl cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the bladder cancer proteins are expressed in mammalian 

25 cells. Mammalian expression systems include retroviral and adenoviral systems. Retroviral 
vector systems are described in PCT/US97/01019 and PCT/US97/01048. Of particular use 
are promoters from mammalian viral genes, since viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
vims LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

30 CMVpromoter(see, e.g., Fernandez and Hoeffler, supra). Typically, transcription 
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termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived from SV40. 
5 Methods of introducing exogenous nucleic acid into mammalian and other hosts are 

well loiown, and will vary with the host cell used. Techniques mclude dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 
fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, 
and direct microinjection of the DNA into nuclei. 

10 In another embodiment, bladder cancer proteins are expressed in bacterial systems. 

Promoters from bacteriophage may also be used. Synthetic promoters and hybrid promoters 
are also useful; e.g., the tac promoter is a hybrid of the trp and lac promoter sequences. A 
bacterial promoter can include naturally occurring promoters of non-bacterial origin that have 
the ability to bind bacterial RNA polymerase and initiate transcription. Often an efficient 

1 5 ribosome binding site is desirable. The expression vector may include a signal peptide 
sequence that provides for secretion of the bladder cancer protein. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may include a selectable marker gene to allow for the selection of bacterial 

20 strains that have been transformed. Suitable selection genes mclude genes which render the 
bacteria resistant to drugs, e.g., ampicillin, chloramphenicol, erythromycin, kanamycin, 
neomycin, and tetracycline, or biosynthetic genes, e.g., those in the histidine, tryptophan, and 
leucine biosynthetic pathways. These components are assembled into expression vectors. 
Expression vectors for bacteria include vectors for Bacillus subtilis, E. coli, Streptococcus 

25 cremoris, and Streptococcus Uvidans, among others (e.g., Fernandez and Hoeffler, supra). 
The bacterial expression vectors are transformed into bacterial host cells using, e.g., calcium 
chloride freatment, electroporation, and other methods. 

Bladder cancer proteins can also be produced in insect cells. See, e.g., Miller, et al. 
(1 997) Baculovirus Expression Vectors: A Laboratory Manual Oxford Books; ISBN: 

30 0716770172; and Makrides (1999) Prot. Expr. Purif 17:183-202. 
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Bladder caiicer protein may be produced in yeast cells. Yeast expression systems 
exist with expression vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, 
Hansenulapolymorpha, Kluyveromyces fragilis andK. lactis, Pichia guillerimondii and P. 
pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. See, e.g., Jones, et al. (eds. 
5 1993) The Molecular and Cellular Biology of the Yeast Saccharomyces: Gene Expression 
CSH Press; ISBN: 0879693657. 

The bladder cancer protein may also be made as a fusion protein, using techniques 
well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired 
epitope is small, the bladder cancer protein may be fused to a carrier protein to form an 

1 0 immunogen, Alternatively, the bladder cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the bladder cancer protein is a 
bladder cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression or purification purposes. 

The bladder cancer protein is typically purified or isolated after expression. Bladder 

1 5 cancer proteins may be isolated or purified in a variety of ways, depending on what other 

components are present in the sample. Standard purification methods include electrophoretic, 
molecular, immunological, and chromatographic techniques, including ion exchange, 
hydrophobic, affmity, reverse-phase HPLC chromatography, and chromatofocusing. The 
bladder cancer protein may be purified using a standard anti-bladder cancer protein antibody 

20 affmity colunm. Ultrafiltration and diafiltration techniques, in conjunction with protein 

concentration, are also useful. For general guidance in suitable purification techniques, see, 
e.g.. Scopes (1982) Protein Purification Springer-Verlag. The degree of purification 
necessary will vary depending on the use of the bladder cancer protein. In some instances no 
purification will be necessary, which may depend on the intended use. 

2 5 Once expressed and purified, if necessary, the bladder cancer proteins and nucleic 

acids are useful in a number of applications. They may be used as immunoselection reagents, 
as vaccine reagents, as screening agents, etc. 

Variants of bladder cancer proteins 
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In one embodiment, the bladder cancer proteins are derivative or variant bladder 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative bladder cancer peptide will often contain at least one amino acid substitution, 
deletion, or insertion, with amino acid substitutions being particularly preferred. The amino 
5 acid substitution, insertion, or deletion may occur at most residues witliin the bladder cancer 
peptide. 

Certain embodiments of bladder cancer proteins of the present invention are amino 
acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by 

1 0 site specific mutagenesis of nucleotides in the DNA encoding the bladder cancer protein, 
using cassette or PGR, mutagenesis, or other techniques well known in the ait, to produce 
DNA encodmg the variant, and thereafter expressing the DNA in recombinant cell culture as 
outlined above. However, variant bladder cancer protein fragments having up to about 100- 
150 residues may be prepared by in vitro synthesis using estabUshed techniques. Amino acid 

1 5 sequence variants are often characterized by the predetermined nature of the variation, a 
feature that sets them apart from naturally occurring allelic or interspecies variation of the 
bladder cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, altliough variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

20 While the site or region for introducing an amino acid sequence variation is often 

predetermined, the mutation per se need not be predetermined. To optimize the performance 
of a mutation at a given site, random mutagenesis may be conducted at the target codon or 
region and the expressed bladder cancer variants screened for the optimal combination of 
desired activities. Techniques for making substitution mutations at predetermined sites in 

25 DNA having a known sequence are well known, e.g., M13 primer mutagenesis and PGR 
mutagenesis. Screening of the mutants is performed using assays of bladder cancer protein 
activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
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tolerated. Deletions range from about 1-20 residues, although in some cases deletions may be 
much larger. 

Substitutions, deletions, insertions, or combinations thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
5 alteration of the molecule. However, larger changes may be tolerated in certain 

circumstances. When small alterations in the characteristics of the bladder cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and elicit the 

1 0 same immune response as the naturally-occuiTing analog, although variants also are selected 
to modify the characteristics of the bladder cancer proteins as needed. Alternatively, the 
variant may be designed such that the biological activity of the bladder cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by selecting 

1 5 substitutions that are less conservative than those described above. Substitutions may be 

made which more significantly affect: the structure of the polypeptide backbone in the area of 
the alteration, e.g., the alpha-helical or beta-sheet structure; the charge or hydrophobicity of 
the molecule at the target site; or the bulk of the side chain. Substitutions which are expected 
to produce the greatest changes in the polypeptide's properties are those in which (a) a 

20 hydrophiUc residue, e.g., serine or threonine is substituted for (or by) a hydrophobic residue, 
e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a cysteine or proline is 
substituted for (or by) another residue; (c) a residue having an electropositive side chain, e.g., 
lysine, arginine, or histidine, is substituted for (or by) an electronegative residue, e.g., 
glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, 

25 is substituted for (or by) one not having a side chain, e.g., glycine. 

Covalent modifications of bladder cancer polypeptides are included within the scope 
of this invention. One type of covalent modification includes reacting targeted amino acid 
residues of a bladder cancer polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N-or C-terminal residues of a bladder cancer 

30 polypeptide. Derivatization with biflinctional agents is useful, e.g., for crosslinking bladder 
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cancer polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti-bladder cancer polypeptide antibodies or screening assays. Commonly used 
crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
5 imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), 
bifunctional maleimides such as bis-N-maIeimido-l,8-octane and agents such as methyl-3- 
((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to 
the corresponding glutamic and aspartic residues, respectively, hydroxylation of proline and 

1 0 lysine, phosphorylation of hydroxyl groups of serine, threonine or tyrosine residues, 

methylation of the amino groups of the lysine, arginine, and histidine side chains (pp. 79-86, 
Creighton (1984) Proteins: Structure and Molecular Properties Freeman), acetylation of the 
N-termiaal amine, and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the bladder cancer polypeptide included 

1 5 within the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altermg the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence bladder cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence bladder cancer polypeptide. Glycosylation patterns can be altered in many ways. 

20 For example the use of different cell types to express bladder cancer-associated sequences 
can result in different glycosylation patterns. 

Addition of glycosylation sites to bladder cancer polypeptides may also be 
accomplished by-altering the amino acid sequence thereof The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 

25 sequence bladder cancer polypeptide (for O-linked glycosylation sites). The bladder cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the bladder cancer polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the bladder 

30 cancer polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. 
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Such methods are described in the art, e.g., in WO 87/05330, and pp. 259-306 in ApUn and 
Wriston (1981) CRC Crit. Rev. Biochem. 

Removal of carbohydrate moieties present on the bladder cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
5 amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are laiown in the art. See, e.g., Halcimuddin, at al. (1987) Arch. Biochem. 
Biophvs. 259:52-57; and Edge, et al. (1981) Anal. Biochem. 11 8: 13 1-137. Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of 
endo-and exo-glycosidases. See, e.g., Thotalcura, et al. (1987) Meth. Enzymol. 138:350-359. 

1 0 Another type of covalent modification of bladder cancer comprises linlcing the 

bladder cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyaUcylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192; or 4,179,337. 

Bladder cancer polypeptides of the present invention may also be modified to form 

1 5 chimeric molecules comprising a bladder cancer polypeptide fused to a heterologous 

polypeptide or amino acid sequence. In one embodiment, a chimeric molecule comprises a 
fusion of a bladder cancer polypeptide witli an epitope tag. The epitope tag is generally 
placed at the amino-or carboxyl-terminus of the bladder cancer polypeptide. The presence of 
such epitope-tagged forms of a bladder cancer polypeptide can be detected using an antibody 

20 against the tag polypeptide. Also, provision of the epitope tag enables the bladder cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a bladder cancer pol>'peptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 

25 chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. 
(1988) Mol. Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 

30 9E10 antibodies thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:361 0-3616); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. 
(1990) Protein Engineering 3:547-553). Other tag polypeptides include the Flag-peptide 
(Hopp, et al. (1988) BioTechnology 6:1204-1210); the KT3 epitope peptide (Martin, et al. 
(1992) Science 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 
5 266:15163-15166); and theT7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) 
Proc. Nat'l Acad. Sci. USA 87:6393-6397). 

Also included are other bladder cancer proteins of the bladder cancer family, and 
bladder cancer proteins from other organisms, which are cloned and expressed as outhned 
below. Thus, probe or degenerate polymerase chain reaction (PGR) primer sequences may be 

1 0 used to find other related bladder cancer proteins from humans or other organisms. As will 
be appreciated by those in the art, particularly useful probe and/or PGR primer sequences 
include the unique areas of the bladder cancer nucleic acid sequence. Preferred PGR primers 
are from about 15-35 nucleotides in length, with from about 20-30 being preferred, and may 
contain inosine as needed. The conditions for the PGR reaction are well known. See, e.g., 

15 Innis (1990) PGR Protocols, supra. 

Antibodies to bladder cancer proteins 

In a preferred embodiment, when the bladder cancer protein is to be used to generate 
antibodies, e.g., for immunotherapy or inununodiagnosis, the bladder cancer protein should 

20 share at least one epitope br determinant with the full length protein. By "epitope" or 

"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHG. Thus, in most instances, antibodies 
made to a smaller bladder cancer protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 

25 antibodies generated to a unique epitope show Uttle or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known (see, e.g., Cohgan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 

30 or intraperitoneal injections. The immunizing agent may include a protein encoded by a 



wo U3/003906 



PCT/US02/21338 



nucleic acid of the tables or fragment thereof or a fusion protein thereof It maybe usefUl to 
conjugate the immunizing agent to a protein known to be immunogenic in the mammal being 
immimized. Examples of such immunogenic proteins include but are not limited to keyhole 
limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
5 Examples of adjuvants which may be employed include Freund's complete adjuvant and 
MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicoryiiomycolate). The 
immunization protocol may be selected as appropriate. 

The antibodies may be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein (1975) 

10 Nature 256:495-497. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immxinized with an immunizing agent to elicit lymphocytes that produce 
or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of Tables 1 A-1 3 or fragment 

1 5 thereof, or a ftision protein thereof Generally, either peripheral blood lymphocytes ("PBLs") 
are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fiised with an 
immortahzed cell line using a suitable ftising agent, such as polyethylene glycol, to form a 
hybridoma cell (pp. 59-103, Coding (1986) Monoclonal Antibodies: Principles and Practice 

20 Academic Press). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell Knes are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inliibit the growth or survival of 
the unfused, immortahzed cells. For example, if the parental cells lack the enzyme 

25 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 

30 specificities for at least two different antigens or that have binding specificities for two 
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epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1 A-13 or a fragment thereof, the other one is for 
another antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
5 multivalent reagents. 

In a preferred embodiment, the antibodies to bladder cancer protein are capable of 
reducing or eliminating a biological function of a bladder cancer protein, as is described 
below. That is, the addition of anti-bladder cancer protein antibodies (either polyclonal or 
preferably monoclonal) to bladder cancer tissue (or cells containing bladder cancer) may 

10 reduce or eUminate the bladder cancer. Generally, at least about 25% decrease in activity, 
growth, size, or the Hke is preferred, with at least about 50% being particularly preferred, and 
about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the bladder cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences; Medarex, Inc.; Abgenix, Inc.; Protein 

15 Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 

20 determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat, or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 

25 imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

30 at least a portion of an immunoglobulin constant region (Fc), typically that of a human 



wo U3/003906 



PCT/US02/21338 



immunoglobulin. See Jones, et al. (1986) Nature 321:522-525; Riechmaim, et al. (1988) 
Nature 332:323-329; and Presta (1992) Curr. Op. Struct. Biol. 2:593-596. Humanization can 
be performed, e.g., following the method of Winter and co-workers (see Jones, et al. (1986) 
Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 

5 Science 239:1534-1536), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,81 6,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

1 0 Human antibodies can also be produced using various techniques known in the art, 

including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381- 
388; Marks, et al, (1991) J. Mol. Biol. 222:581-597) or of human monoclonal antibodies 
(e.g., p, 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer 
Therapy Liss; and Boemer, et al. (1991) J. Inamunol. 147:86-95). Similarly, human 

1 5 antibodies can be made by introducing of human immunoglobulin loci into transgenic 

animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or 
completely inactivated. Upon challenge, human antibody production is observed, which 
closely resembles that seen in humans in all respects, including gene rearrangement, 
assembly, and antibody repertoire. This approach is described, e.g., in U.S. Patent Nos. 

20 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Neuberger (1996) Nature Biotechnology 14:826 commenting on 
Fishwild, et al. (1996) Nature Biotechnology 14:845-51; and Lonberg and Huszar (1995) 
Intern. Rev. Immunol. 13:65-93. 

25 By immunotherapy is meant treatment of bladder cancer with an antibody raised 

against bladder cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
(patient), which may be used to target a label or toxin. Active immunization is the induction 
of antibody and/or T-cell responses in a recipient (patient). Induction of an immune response 

30 is the result of providmg the recipient with an antigen to which antibodies are raised. As 
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appreciated by one of ordinary skill in the art, the antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 
expression of the antigen, leading to an immune response. 
5 In a prefen ed embodiment the bladdei- cancer proteins against which antibodies are 

raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted bladder cancer protein. 

In another preferred embodiment, the bladder cancer protein to which antibodies are 

10 raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the bladder cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
antibody may cause down-regulation of the fraasmembrane bladder cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 

1 5 competitive or uncompetitive inhibitor of protein binding to the exfracellular domain of the 
bladder cancer protein. The antibody is also an antagonist of tlie bladder cancer protein. 
Further, the antibody prevents activation of the transmembrane bladder cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the bladder cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 

20 sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-P, IL-1, INF-y 
and IL-2, or chemo therapeutic agents including 5FU, vinblastine, actinomycin D, cisp latin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, bladder cancer is 

25 treated by administering to a patient antibodies directed against the transmembrane bladder 
cancer protein. Antibody-labeling may activate a co-toxin, locahze a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The eflFector moiety can be a number of molecules, including labelling moieties such as 

30 radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
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therapeutic moiety is a small molecule that modulates the activity of the bladder cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
associated with or in close proximity to the bladder cancer protein. The therapeutic moiety 
may inhibit enzj/matic activity such as protease or collagenase or protein kinase activity 
5 associated with bladder cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to bladder cancer tissue or cells, resuhs in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with bladder 
cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic diTigs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin, and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against bladder cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

1 5 the antibody. Targeting the therapeutic moiety to transmembrane bladder cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the bladder cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the bladder cancer protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the bladder cancer protein can be targeted within a 
cell, e.g., the nucleus, an antibody thereto contains a signal for that target localization, e.g., a 

25 nuclear localization signal. 

The bladder cancer antibodies of the invention specifically bind to bladder cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
Kd of at least about 0.1 mM, more usually at least about 1 |jM, preferably at least about 0.1 
pM or better, and most preferably, 0.01 \xM. or better. Selectivity of bindmg is also 

30 important. 
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Detection of bladder cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the bladder cancer phenotype. Expression levels of genes in normal tissue 
5 (e.g., not experiencing bladder cancer) and in bladder cancer tissue (and in some cases, for 
varying severities of bladder cancer that relate to prognosis, as outlined below), or in non- 
malignant disease, are evaluated to provide expression profiles. An expression profile of a 
particular cell state or point of development is essentially a "fingerprint" of the state. While 
two states may have a particular gene similarly expressed, the evaluation of a number of 

10 genes simultaneously allows the generation of a gene expression profile that is reflective of 
the state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 

15 tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

20 normal versus bladder cancer tissue. Genes may be tiimed on or turned off in a particular 
state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression maybe quantitative, e.g., in 

25 that expression is increased or decreased; e.g., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays. See Lockhart (1996) Nature Biotechnology 14:1675-1680. 

30 Other techniques include, but are not limited to, quantitative reverse transcriptase PGR, 
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northern analysis and RNase protection. As outlined above, preferably the change in 
expression (e.g., upregulation or downregulation) is at least about 50%, more preferably at 
least about 100%, more preferably at least about 150%, more preferably at least about 200%, 
with from about 300-1000% being especially preferred. 
5 Evaluation may be at the gene transcript, or the protein level. The amount of gene 

expression may be monitored using nucleic acid probes to the DNA or RNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) can be monitored, e.g., with antibodies to the bladder cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 

1 0 spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to bladder 
cancer genes, e.g., those identified as being important in a bladder cancer or disease 
phenotype, can be evaluated in a bladder cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed simultaneously 
on a nimiber of genes. Multiple protein expression monitoring can be performed as well. 

1 5 Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of bladder cancer sequences in a 
particular cell. The assays are further described below in the example. PGR techniques can 
be used to provide gi'eater sensitivity. 

20 In a prefen-ed embodiment nucleic acids encoding the bladder cancer protein are 

detected. Although DNA or RNA encoding the bladder cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a bladder cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 

25 oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 

30 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
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probe for sufficient time to allow the probe to hybridize with the target uiRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example, a 
digoxygeiiin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
bladder cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
5 secondary antibody and developed v^dth nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The bladder cancer proteins, antibodies, nucleic acids, modified proteins and cells 
10 containing bladder cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or conesponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

15 As described and defined herein, bladder cancer proteins, including intracellular, 

transmembrane or secreted proteins, find use as diagnostic or prognostic markers of bladder 
cancer, or to assist in selecting theipay based on expression profile and archival data. 
Detection of these proteins in putative bladder cancer tissue allows for detection or diagnosis 
of bladder cancer. In one embodiment, antibodies are used to detect bladder cancer proteins. 

20 A preferred method separates proteins firom a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the bladder cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the bladder cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

25 In another preferred method, antibodies to the bladder cancer protein find use in in 

situ imaging teclmiques, e.g., in histology (e.g., Asai (ed. 1993) "Antibodies in Cell Biology" 
Methods in Cell Biology (vol. 37). In this method cells are contacted with firom one to many 
antibodies to the bladder cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

30 the antibody is detected by incubating with a secondary antibody that contains a detectable 
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label. In another method the primary antibody to the bladder cancer protein(s) contains a 
detectable label, e.g. an en2yme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of bladder cancer 
5 proteins. As will be appreciated by one of ordinary skill in the art, many other histological 
imaging techniques ai'e also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

10 In another prefeired embodiment, antibodies find use in diagnosing bladder cancer 

from blood, serum, plasma, stool, urine, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of bladder cancer proteins. 
Antibodies can be used to detect a bladder cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 

15 immmoprecipitation, BIACORE technology and the like. Conversely, the presence of 

antibodies may indicate an immune response against an endogenous bladder cancer protein. 

In a preferred embodiment, in situ hybridization of labeled bladder cancer nucleic 
acid probes to tissue arrays is done. For example, arrays of tissue samples, including bladder 
cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) 

20 is tlien performed. When comparing the fingerprints between an individual and a standard, 
the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It 
is further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 

25 fri a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing bladder cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to bladder cancer, clinical, 
pathological, or other information, e.g., in terms of long term prognosis. Again, this may be 
done on either a protein or gene level, with the use of genes being preferred. Single or 

30 multiple genes may be useful in various combinations. As above, bladder cancer probes may 
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be attached to biochips for the detection and quantification of bladder cancer sequences in a 
tissue or patient. The assays proceed as outlined above for diagnosis. PGR methods may 
provide more sensitive and accurate quantification. 

5 Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and antibodies as 
described herein are used in dmg screening assays. The bladder cancer proteins, antibodies, 
nucleic acids, modified proteins and cells containing bladder cancer sequences are used in 
drug screening assays or by evaluating the effect of drug candidates on a "gene expression 

1 0 profile" or expression profile of polypeptides. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent. See, 
e.g., Zlokamik, et al. (1998) Science 279:84-88; and Held (1996) Genome Res. 6:986-94. 
In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

1 5 modified proteins and cells containing the native or modified bladder cancer proteins are used 
in screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the bladder cancer phenotype or an identified physiological 
fimction of a bladder cancer protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

20 embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokamik, supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
executed. In a preferred embodiment, assays may be run on an individual gene or protein 

25 level. That is, having identified a particular gene as up regulated in bladder cancer, test 

compounds can be screened for the ability to modulate gene expression or for binding to the 
bladder cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing bladder cancer, with changes of at least 

30 about 10%, preferably about 50%, more preferably about 100-300%, and in some 
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embodiments about 300-1000% or greater. Thus, if a gene exhibits about 4-fold increase in 
bladder cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, about 10-foId decrease in bladder cancer tissue compared to normal tissue 
often provides a target value of about 10-fold increase in expression to be induced by the test 
5 compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
monitored, e.g., tlirough the use of antibodies to the bladder cancer protein and standard 
inamunoassays. Proteomics and separation techniques may also allow quantification of 
10 expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
involve a plurality of those entities described herein.. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
1 5 outlined herein for the detection and quantification of bladder cancer sequences in a 

particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PGR reaction can then be performed and 
analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify the 
20 expression of one or more bladder cancer-associated sequences, e.g., a polynucleotide 

sequence set out inTables lA-13. Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate bladder cancer, modulate bladder cancer proteins, bind to a bladder cancer 
protein, or interfere with the binding of a bladder cancer protein and an antibody or other 
25 binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the bladder cancer phenotype or the expression of a bladder cancer sequence, 
30 e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
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expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a bladdei* cancer phenotype, e.g., to a normal tissue or 
non-malignant fingerprint. In another embodiment, a modulator induced a bladder cancer 
phenotype. Generally, a plurality of assay mixtures are run in parallel with different agent 
5 concentrations to obtain a differential response to the various concentrations. Typically, one 
of these concentrations serves as a negative control, e.g., at zero concentration or below the 
level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 

10 than about 100 and less than about 2,500 daltons. Preferred small molecules are less than 
about 2000, or less than about 1500 or less than about 1000 or less than about 500 D. 
Candidate agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxyl, or carboxyl group, preferably at least two of the functional chemical groups. The 

15 candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof 
Particularly preferred are peptides. 

20 In one aspect, a modulator will neutralize the effect of a bladder cancer protein. By 

"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a bladder cancer polypeptide or to modulate activity. 
25 Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compoimd, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 
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In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
5 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual tlierapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., 
the number of amino acids in a polypeptide compound). Millions of chemical compoimds 
can be synthesized through such combinatorial mixing of chemical building blocks. See, e.g., 
Gallop, et al. (19941 J. Med. Chem. 37:1233-1251. 

1 5 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature 354:84-88); peptoids (PCX Publication No WO 
91/19735); encoded peptides (PCX Publication WO 93/20242); random bio-oligomers (PCX 

20 Publication WO 92/00091); benzodiazepines (U.S. Pat. No. 5,288,514); diversomers such as 
hydantoins, benzodiazepines, and dipeptides (Hobbs, et al. (1993) Proc. Nat'l Acad. Sci. USA 
90:6909-6913); vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
114:6568-6570); nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirsclmiann, et al. (1992) J. Amer. Chem. Soc. 114:9217-9218); analogous organic 

25 syntheses of small compound Hbraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661- 
2662); oligocarbamates (Cho, et al. (1993) Science 261:1303-1305); and/or peptidyl 
phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658-xxx). See, generally, Gordon, et 
al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Strategene, Corp.); 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083); antibody libraries (see, e.g., 

30 Vaughn, et al. (1996) Nature Biotechnology 14:309-314, and PCX/US96/10287); 
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carbohydrate libraries (Liang, et al. (1996) Science 274:1520-1522, and U.S. Patent No, 
5,593,853); and small organic molecule libraries (see, e.g., benzodiazepines, Baum (p. 33, 
Jan 18, 1993) C&ENewsl : isoprenoid (U.S. Patent No. 5,569,588); thiazolidinones and 
metathiazaiiones (U.S. Patent No. 5,549,974); pyrrolidines (U.S. Patent Nos. 5,525,735 and 
5 5,519,134); morpholino compounds (U.S. Patent No. 5,506,337); benzodiazepines (U.S. 
Patent No. 5,288,5 14); and the like. 

Devices for the preparation of combinatorial libraries are commercially available. 
See, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY; Symphony, Rainin, 
Wobuni, MA; 433 A Apphed Biosystems, Foster City, CA; and 9050 Plus, Millipore, 
10 Bedford, MA. 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate H, Zymark Corporation, Hopkinton, MA; Orca, 

15 Hewlett-Packard, Palo Alto, CA), which mimic the manual synthetic operations performed by 
a chemist. The above devices are suitable for use with the present invention. The nature and 
implementation of modifications to these devices (if any) so that they can operate as 
discussed herein will be apparent to persons skilled in the relevant art. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, 

20 Princeton, NJ; Asinex, Moscow, Ru; Tripos, Inc., St. Louis, MO; ChemStar, Ltd, Moscow, 
RU; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of bladder cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 

25 polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties 
of particular nucleic acids or protein products are well known to those of skiU in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., U.S. 
Patent No. 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 

30 No. 5,585,639 discloses high throughput screening methods for nucleic acid binding (e.g., in 
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arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 
screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beclcman 
5 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA; etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a higli degree of flexibility and customization. The manufacturers of such systems provide 

1 0 detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, hgand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 

1 5 random or directed digests of proteinaceous cellular extracts, may be used. Tn. this way 

Ubraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammaUan proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 
useful test compound will be directed to the class of protems to which the target belongs, e.g., 

20 substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids, 
with from about 5-20 amino acids being preferred, and from about 7-15 being particularly 
preferred. The peptides may be digests of naturally occurring proteins as is outlined above, 
random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 

25 herein is meant that each nucleic acid and peptide consists of essentially random nucleotides 
and amino acids, respectively. Since generally these random peptides (or nucleic acids, 
discussed below) are chemically synthesized, they may incorporate nucleotide or amino acid 
substitutions. The synthetic process can be designed to generate randomized proteins or 
nucleic acids, to allow the formation of all or most of the possible combinations over the 
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length of the sequence, thus forming a library of randomized candidate bioactive 
proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants. In a preferred embodiment, the library is biased. That is, some positions within 
5 the sequence are either held constant, or are selected from a limited number of possibilities. 
For example, in a preferred embodiment, the nucleotides or amino acid residues are 
randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linlcing, prolines for SH-3 domains, serines, 

10 threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of bladder cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids, 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

15 In a preferred embodiment, the candidate compounds are organic chemical moieties, a 

wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 

20 the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 

25 chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 

30 altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
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or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 
is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization 
5 assays or can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 
5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by reference. In 
this embodiment, in general, the target nucleic acid is prepared as outlined above, and then 

1 0 added to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 

15 complex only in the presence of target. Stringency can be controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 

20 outlined in U.S. Patent No. 5,681 ,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 

25 other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc., 
.which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
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The assay data are analyzed to deteraiine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the bladder cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
5 particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
1 0 expressed gene. Again, having identified the importance of a gene in a particular state, 

screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product. 

In addition screens can be done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its abiUty to suppress a bladder cancer 

1 5 expression pattem leading to a normal expression pattern, or to modulate a single bladder 
cancer gene expression profile so as to mimic the expression of the gene firom normal tissue, 
a screen as described above can be performed to identify genes that are specifically 
modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated bladder cancer tissue reveals genes that are not expressed in normal tissue 

20 or bladder cancer tissue, but are expressed in agent treated tissue. These agent-specific 

sequences can be identified and used by methods described herein for bladder cancer genes or 
proteins. In particular these sequences and the proteins they encode find use in marking or 
identifying agent treated cells. In addition, antibodies can be raised against the agent induced 
proteins and used to target novel therapeutics to the treated bladder cancer tissue sample. 

25 Thus, in one embodiment, a test compound is administered to a population of bladder 

cancer cells, that have an associated bladder cancer expression profile. By "administration" 
or "contacting" herein is meant that the candidate agent is added to the cells in such a manner 
as to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

30 candidate agent (e.g., a peptide) may be put into a viral construct such as an adenoviral or 
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retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCX US97/01019. Regulatable gene therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
5 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., bladder cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress the bladder cancer phenotype. A change in at least one 
gene, preferably many, of the expression profile indicates that the agent has an effect on 

10 bladder cancer activity. By defining such a signature for the bladder cancer phenotype, 

screens for new drugs that alter the phenotype can be devised. With this approach, the drug 
target need not be known and need not be represented in the original expression screening 
platform, nor does the level of transcript for the target protein need to change. 

hi a preferred embodiment, as outlined above, screens may be done on individual 

15 genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as important in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "bladder cancer proteins" 
or a "bladder cancer modulatory protein". The bladder cancer modulatory protein may be a 

20 fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
acids of the Tables lA-13. Preferably, the bladder cancer modulatory protein is a fragment. 
In a preferred embodiment, the bladder cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of Tables lA-13. hi 
another embodiment, the sequences are naturally occurring allelic variants of a protein 

25 encoded by a nucleic acid of Tables 1 A-13. In another embodiment, the sequences are 
sequence variants as further described herein. 

Preferably, the bladder cancer modulatory protein is a fragment of approximately 14 
to 24 amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-fransmembrane region. In a preferred embodiment, the fragment has 
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an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of the fragment is 
kept as a free acid and the N-terminus is a free amine to aid in coupling, e.g., to cysteine. 

In one embodiment the bladder cancer proteins are conjugated to an immunogenic 
agent as discussed herein. In one embodiment the bladder cancer protein is conjugated to 
5 BSA. 

Measurements of bladder cancer polypeptide activity, or of bladder cancer or the 
bladder cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the function of the bladder cancer polj^eptides can be 
measured by examining parameters described above. A suitable physiological change that 

1 0 affects activity can be used to assess the influence of a test compound on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of bladder cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 
release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 

1 5 northern blots), changes in ceU metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 
bladder cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a bladder cancer polypeptide is first contacted with a potential modulator and 

20 incubated for a suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the 

bladder cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the bladder cancer polypeptide 
or a fragment thereof For measurement of mRNA, amplification, e.g., using PGR, LCR, or 

25 hybridization assays, e.g., nortliem hybridization, RNase protection, dot blotting, are 
preferred. The level of protein or niRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Altematively, a reporter gene system can be devised using the bladder cancer protein 

30 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
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CAT, or P-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 
5 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
genes are sometimes referred to herein as "bladder cancer proteins." The bladder cancer 
protein may be a fragment, or alternatively, be the full length protein to a fragment shown 
10 herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
1 5 differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed 
nucleic acids are made. For example, antibodies are generated to the protein gene products, 
20 and standard immunoassays are run to determine the amount of protein present. 

Altematively, cells comprising the bladder cancer proteins can be used in the assays. 

Thus, in a prefeixed embodiment, the methods comprise combining a bladder cancer 
protein and a candidate compound, and determining the binding of the compound to the 
bladder cancer protein. Preferred embodiments utilize the human bladder cancer protein, 
25 although other mammalian proteins may also be used, e.g., for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
bladder cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the bladder cancer 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
30 sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
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made of a composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of a convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
5 typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 

10 composition and is nondifflisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the Ugand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess imboimd material is removed by washing. The sample receiving 

1 5 areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the bladder cancer protein is bound to the support, and a 
test compound is added to the assay. Altematively, tlie candidate agent is bound to the 
support and the bladder cancer protein is added. Novel binding agents include specific 

20 antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

25 The detei-mination of the binding of the test modulating compound to the bladder 

•cancer protein may be done in a number of ways. In a prefeiTed embodiment, the compound 
is labeled, and binding determined directly, e.g., by attaching all or a portion of the bladder 
cancer protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the sohd support. 

30 Various blocking and washing steps may be utilized as appropriate. 



wo U3/003906 



PCT/US02/21338 



In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compoimds) can be labeled. Alternatively, more than one 

component can be labeled with different labels, e.g., 125i foj- the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
5 useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety Imown to bind to the target molecule (e.g., 
a bladder cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circiunstances, there may be competitive binding between the compound and the 

1 0 binding moiety, with the binding moiety displacing the compound, hi one embodiment, the 
test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, tj^ically between 4 and 40°C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 

1 5 between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that tlie test compound is binding 

20 to the bladder cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the bladder cancer protein. In this embodiment, either component can be 
labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 

25 In an alternative embodiment, the test compound is added first, with incubation and 

washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the bladder cancer protein with a higher affinity. Thus, if 
the test compound is labeled, the presence of the label on the support, coupled with a lack of 
competitor binding, may indicate that the test compound is capable of binding to the bladder 

30 cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of tlie bladder cancer proteins. In this 
embodiment, the methods comprise combining a bladder cancer protein and a competitor in a 
first sample. A second sample comprises a test compound, a bladder cancer protein, and a 
5 competitor. The binding of the competitor is detemiined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the bladder cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the bladder cancer protein. 

10 Alternatively, differential screening is used to identify drug candidates that bind to the 

native bladder cancer protein, but cannot bind to modified bladder cancer proteins. The 
structure of the bladder cancer protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site, Drug candidates that affect the activity of a 
bladder cancer protein are also identified by screening drugs for the ability to either enhance 

15 or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 

20 amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to deteraiine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
25 facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
30 compound capable of modulating the activity of a bladder cancer protein. The methods 
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comprise adding a test compound, as defined above, to a cell comprising bladder cancer 
proteins. Many different cell types may be transfected to contain a recombinant nucleic acid 
that encodes a bladder cancer protein. In a preferred embodiment, a library of candidate 
agents are tested on a plurality of cells. 
5 In one aspect, the assays are evaluated in the presence or absence or previous or 

subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, and pharmacological agents including, e.g., 
chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In one 
example, the detemiinations are determined at different stages of the cell cycle process. 

10 In this way, compounds that modulate bladder cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the bladder cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

hi one embodiment, a method of inhibiting bladder cancer cell division is provided. 

1 5 The method comprises administration of a bladder cancer inhibitor. In another embodiment, 
a method of inhibiting bladder cancer is provided. The method comprises administration of a 
bladder cancer inhibitor. In a further embodiment, methods of treating cells or individuals 
with bladder cancer are provided. The method comprises administration of a bladder cancer 
inhibitor. In one embodiment, a bladder cancer inhibitor is an antibody as discussed above. 

20 In another embodiment, the bladder cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to those of 
skill in the art, as described below. 
Soft agar growth or colonv formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 

25 transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-sohd media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growtli or colony formation in suspension assays can be used to identify 

30 modulators of bladder cancer sequences, which when expressed in host cells, inhibit 
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abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells' ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-soUd or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
5 described in Fresliney ( 1 994) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) 
Wiley- Liss, herein incorporated by reference. See also, the methods section of Garkavtsev, et 
al. (1996), supra, herein incorporated by reference. 
Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 

10 touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the fa:ansfomied cells gi-ow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within tiie regular 

1 5 pattern of normal surroimding cells. Alternatively, labeling index with (3H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (3H)-thyinidine at saturation density is a preferred 

20 method of measuring density limitation of growth. Transfomied host cells are transfected 
with a bladder cancer-associated sequence and are grown for 24 hours at saturation density in 
non-limiting medium conditions. The percentage of cells labeling witii (3H)-thymidine is 
determined autoradiographically. See, Freshney (1994), supra. 
Growth factor or serum dependence 

25 Transformed cells have a lower serum dependence than their normal counterparts 

(see, e.g., Temin (1966) J. Nat'l Cancer hist. 37:167-175; Eagle, et al. (1970) J. Exp. Med. 
131 :836-879; Freshney (1994), supra). This is in part due to release of various growth factors 
by the transformed cells. Growtii factor or serum dependence of transformed host cells can 
be compared with that of control. 

30 Tumor specific markers levels 
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Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells. See, e.g., 
"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 
5 184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 
angiogenesis factor (TAF) is released at a higher level in tumior cells than their normal 
coimterparts. See, e.g., Folkman (1992) Sem Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. See also, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

10 Strickland and Beers (1976) J. Biol. Chem. 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; GuUino "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer, Plenum; 
and Freshnev (1985) Anticancer Res . 5:111-130, 
hivasiveness into Matrigel 

15 The degree of invasiveness into Matrigel or some other extracellular matrix 

constituent can be used as an assay to identify compoimds that modulate bladder cancer- 
associated sequences. Tumor cells exhibit a good correlation between maUgnancy and 
invasiveness of cells into Matrigel or some other exfracellular matrix constituent, In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 

20 gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1984), supra, can be used. Briefly, the level of 
invasion of host cells can be measured by using filters coated with Matrigel or some other 
extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 
filter, is rated as invasiveness, and rated histologically by number of cells and distance 

25 moved, or by prelabeling the cells with 125i and counting the radioactivity on the distal side 
of the filter or bottom of the dish. See, e.g., Freshney (2000), supra. 
Tumor growth in vivo 

Effects of bladder cancer-associated sequences on cell growth can be tested in 
transgenic or inomune-suppressed mice. Knock-out transgenic mice can be made, in which 

30 the bladder cancer gene is disrupted or in which a bladder cancer gene is inserted. Knock-out 
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transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous bladder cancer gene site in the mouse genome via homologous 
recombination. Such mice can also be made by substituting the endogenous bladder cancer 
gene with a mutated version of the bladder cancer gene, or by mutating the endogenous 
5 bladder cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. By breeding the chimeric 

10 mice it is possible to obtain a new line of mice containing the introduced genetic lesion. See, 
e.g., Capecchi, et al. (1989) Science 244:1288-1292. Chimeric targeted mice can be made. 
See Hogan, et al. (1988) Manipulatmg the Mouse Embryo: A Laboratory Manual, CSH Press; 
and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach IRL Press, Washington, D.C. 

1 5 Altematively, various immune-suppressed or immune-deficient host animals can be 

used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) L 
Nat'l Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41 :52-61) can be used as a host. Transplantable tumor cells (typically about 106 

20 cells) injected into isogenic hosts will produce invasive tumors in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a bladder cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably about 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 

25 statistically significant reduction (using, e.g., Student's T test) are said to have inhibited 
growth. 

Polynucleotide modulators of bladder cancer 

Antisense and RNAi Polynucleotides 
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In certain embodiments, the activity of a bladder cancer-associated protein is down- 
regulated, or entirely inhibited, by the use of antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a bladder cancer protein mRNA, or a subsequence thereof 
5 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 

10 sugar linkages. Exeniplai'y among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the bladder cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, 

15 or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oUgonucleotides such as 
phosphorothioates and alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oUgonucleotides. 
Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 

20 sense strand. The antisense and sense ohgonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for bladder cancer molecules. A preferred antisense molecule is for a 
bladder cancer sequences in Tables lA-13, or for a ligand or activator thereof Antisense or 
sense oUgonucleotides, according to the present invention, comprise a fragment generally at 

25 least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The abiUty to derive 
an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given 
protein is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659-2668; and van der 
Krol, et al. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression m a sequence specific 

30 manner. See, e.g., Brumelkamp, et al. (2002) Sciencexnress (21March2002); Sharp (1999) 
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Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 
498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 
5 treatment of or vaUdation of relevance to disease 
Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of bladder cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

10 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes. See, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes. 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

15 5,254,678. Methods of preparing them are well known. See, e.g., WO 94/26877; Ojwang, et 
al. (1993) Proc. Nat'l Acad. Sci. USA 90:6340-6344; Yamada, et al. (1994) Human Gene 
Therapy 1:39-45; Leavitt, et al.(1995) Proc. Nat'l Acad. Sci. USA 92:699-703; Leavitt, et al. 
(1994) Human Gene Therapy 5:1 151-120; and Yamada, et al. (1994) Virology 205: 121-126. 
Polynucleotide modulators of bladder cancer may be introduced into a cell containing 

20 the target nucleotide sequence by formation of a conjugate with a hgand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not Umited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 

25 corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of bladder 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 

30 used in screening assays as discussed above, in addition to methods of treatment. 
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Thus, in one embodiment, methods of modulating bladder cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-bladder cancer antibody that reduces or eliminates the biological activity of an 
endogenous bladder cancer protein. Altematively, the methods comprise administering to a 
5 cell or organism a recombinant nucleic acid encoding a bladder cancer protein. This may be 
accomplished in many ways. In a preferred embodiment, e.g., when the bladder cancer 
sequence is down-regulated in bladder cancer, such state may be reversed by increasing the 
amount of bladder cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous bladder cancer gene or administering a gene encoding the 

10 bladder cancer sequence, using known gene- therapy techniques. In a preferred embodiment, 
the gene therapy techniques include the incorporation of the exogenous gene using enhanced 
homologous recombination (EHR), e.g., as described in PCT/US93/03868, hereby 
incorporated by reference in its entirety. Altematively, e.g., when the bladder cancer 
sequence is up-regulated in bladder cancer, the activity of the endogenous bladder cancer 

1 5 gene is decreased, e.g., by the administration of a bladder cancer antisense nucleic acid. 

In one embodiment, the bladder cancer proteins of the present invention may be used 
to generate polyclonal and monoclonal antibodies to bladder cancer proteins. Similarly, the 
bladder cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify bladder cancer 

20 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a bladder cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The bladder cancer 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
bladder cancer proteins. The antibodies may also be used as blocking polypeptides, as 

25 outlined above, since they will specifically bind to the bladder cancer protein. 

Methods of identifying variant bladder cancer-associated sequences 

Without being bound by theory, expression of various bladder cancer sequences is 
correlated with bladder cancer. Accordingly, disorders based on mutant or variant bladder 
30 cancer genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant bladder cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous bladder cancer genes in a cell. This may be 
accomplished using many sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the bladder cancer genotype of an individual, e.g., 
5 detennining all or part of the sequence of at least one bladder cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced bladder cancer gene to a known bladder cancer 
gene, e.g., a wild-type gene. 

1 0 The sequence of all or part of the bladder cancer gene can then be compared to the 

sequence of a known bladder cancer gene to determine if differences exist. This can be done 
using many known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the bladder cancer gene of the patient and 
the known bladder cancer gene correlates with a disease state or a propensity for a disease 

1 5 state, as outlined herein. 

In a preferred embodiment, the bladder cancer genes are used as probes to determine 
the number of copies of the bladder cancer gene in the genome. 

In another preferred embodiment, the bladder cancer genes are used as probes to 
determine the chromosomal localization of the bladder cancer genes. Information such as 

20 chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the bladder 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

25 In one embodiment, a therapeutically effective dose of a bladder cancer protein or 

modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques. See, e.g., Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug 

30 Delivery Lippincott; Lieberman (1992) Pharmaceutical Dosage Forms (vols. 1-3) Dekker, 
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ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art, Science 
and Technoloev of Pharmaceutical Compounding Amer. Pharma, Assn.; and Pickar (1999) 
Dosage Calculations Thomson. As is known in the art, adjustments for bladder cancer 
degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as 
5 the age, body weight, general health, sex, diet, time of administration, drug interaction and 
the severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art. USSN 09/687,576, further discloses the use of 
compositions and methods of diagnosis and treatment in bladder cancer and is hereby 
expressly incorporated by reference. 

1 0 A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the bladder cancer proteins and modulators thereof of the 

1 5 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapuhnonary, vaginally, rectally, or intraocularly. In 
some instances, e.g., in the treatment of wounds and inflammation, the bladder cancer 
proteins and modulators may be directly applied as a solution or spray. 

20 The pharmaceutical compositions of the present invention comprise a bladder cancer 

protein in a form suitable for administi-ation to a patient, hi the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 
pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 

25 biological effectiveness of the free bases and that are not biologically or otherwise 

undesirable, formed with inorganic acids such as hydrochloric. acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycohc acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fimiaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandehc acid, 

30 methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
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"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from phannaceutically 
5 acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occuning substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 

10 carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
lactose, com and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of adminisfration. For example, unit dosage forms 

1 5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that bladder cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

20 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a bladder cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 

25 solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodiimi chloride, potassium chloride, calcium chloride, 

30 sodium lactate and the like. The concentration of active agent in these formulations can vary 
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widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Hardman and Limbird 
(eds. 2001) Goodman and Oilman: The Pharmacologial Basis ofTheraueutics McGraw-Hill. 
5 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0. 1-10 mg per patient per day. Dosages from about 0. 1-100 mg per patient per day 
may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

1 0 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gihnan: The Pharmacologial Basis 
of Therapeutics, supra. 

The compositions containing modulators of bladder cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 

1 5 compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple adminisfrations of the compositions may be administered 

20 depending on the dosage and frequency as required and tolerated by the patient. The 

composition should provide a sufficient quantity of the agents of this invention to effectively 
treat the patient. An amount of modulator that is capable of preventing or slowing the 
development of cancer in a mammal is referred to as a "prophylactically effective dose." The 
particular dose required for a prophylactic treatment will depend upon the medical condition 

25 and history of the mammal, the particular cancer being prevented, as well as other factors 
such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer based, at least in part, upon gene expression profiles. Vaccine strategies 

3 0 may be used, in either a DNA vaccine form, or protein vaccine. 
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It will be appreciated that the present bladder cancer protein-modulating compounds 
can be administered alone or in combination with additional bladder cancer modulating 
compounds or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 
In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
5 comprising nucleic acid sequences set fortli in Tables lA-13, such as antisense 

polynucleotides or ribozynies, will be introduced into cells, in vitro or in vivo. The present 
invention provides methods, reagents, vectors, and cells useful for expression of bladder 
cancer-associated polypeptides and nucleic acids using in vitro (cell- free), ex vivo, or in vivo 
(cell or organism-based) recombinant expression systems. 

1 0 The particular procedure used to introduce the nucleic acids into a host cell for 

expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors, and other methods for introducing cloned genomic DNA, 

15 cDNA, synthetic DNA or other foreign genetic material into a host cell. See, e.g., Berger and 
Kimmel (1987) Guide to Molecular Cloning Techniques from Methods in Enzymology (vol. 
152) Academic Press; Ausubel, et al. (eds. 1999 and si5)plements) Current Protocols in 
Molecular Biology Lippincott; and Sambrook, et al. (1989) Molecular Cloning: A Laboratory 



as therapeutic agents, and can be foraiulated as outlined above. Similarly, bladder cancer 
genes (including both the fUU-length sequence, partial sequences, or regulatory sequences of 
the bladder cancer coding regions) can be administered in a gene therapy application. These 
bladder cancer genes can include antisense applications, either as gene therapy (e.g., for 
25 incorporation into the genome) or as antisense compositions, as will be appreciated by those 
in the art. 

Bladder cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (Vitiello, et al. (1995) J. Clin, hivest. 95:341-349); 
30 peptide compositions encapsulated in poly(DL-lactide-co-glycoHde) ("PLG") microspheres 



20 
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(Eldridge, et al. (1991) Molec. Imniunol. 28:287-294; Alonso, et al. (1994) Vaccine 12:299- 
306; Jones, et al. (1995) Vaccine 13:675-681); peptide compositions contained in immune 
stimulating complexes (ISCOMS) (Takahashi, et al. (1990) Nature 344:873-875; Hu, et al. 
(1998) Clin. Exp. Immunol. 113:235-243); multiple antigen peptide systems (MAPs) (Tarn 
5 (1988) Proc. Nat'l Acad. Sci. USA 85:5409-5413; Tarn (1996) J. Immunol. Methods 196:17- 
32); peptides fommlated as multivalent peptides; peptides for use in ballistic delivery 
systems, typically crystallized peptides, viral delivery vectors (Perkus, et al. in Kaufinann 
(ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. (1986) Nature 
320:535-537; Hu, et al. (1986) Nature 320:537-547; Kieny, et al. (1986) AIDS 

10 Bio/Technology 4:790; Top, et al. (1971) J. Meet. Dis. 124:148-154; Chanda, et al. (1990) 
Virology 175:535-547), paiticles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) J, 
Immunol. Methods 192:25-35; Eldridge, et al. (1993) Sem. Hematol. 30:16-24; Falo, et al. 
(1995) Nature Med. 7:649-653), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 
4:369-388; Gupta, et al. (1993) Vaccine 11:293-306), liposomes (Reddy, et al. (1992) L 

15 Immunol. 148:1585-1589; Rock (1996) Immunol. Today 17:131-137), or, naked or particle 
absorbed cDNA (Ulmer, et al. (1993) Science 259:1745-1749; Robinson, et al. (1993) 
Vaccine 1 1 :957-960; Shiver, et al. in Kaufinann (ed. 1996) Concepts in Vaccine 
Development de Gruyter; Cease and Berzofsky (1994) Arniu. Rev. Immunol. 12:923-989; 
and Eldridge, et al. (1993) Sem. Hematol. 30:16-24). Toxin-targeted delivery technologies, 

20 also Icnown as receptor mediated targeting, such as tliose of Avant Lnmimotherapeutics, Inc., 
Needham, MA, may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
mineral oil, and a stimulator of inmiune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

30 tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
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polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleiikin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
5 encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et al. (1990) Science 247:1465-1468 as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

10 cationic lipid complexes, and paiticle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 

15 vaccinia virus, e.g., as a vector to express nucleotide sequences that encode bladder cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the inraiunogenic peptide, and thereby ehcits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Cahnette Guerin). BCG vectors are 

20 described in Stover, et al. (1 99 1) Nature 351 :456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization, e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
Hke. See, e.g., Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) I 
Leukoc. Biol. 68:793-806; Hipp, et al. (2000) In Vivo 14:571-85. 

25 Methods for the use of genes as DNA vaccines are well loiown, and include placing a 

bladder cancer gene or portion of a bladder cancer gene under the control of a regulatable 
promoter or a tissue-specific promoter for expression in a bladder cancer patient. The 
bladder cancer gene used for DNA vaccines can encode full-length bladder cancer proteins, 
but more preferably encodes portions of the bladder cancer proteins including peptides 

30 derived from the bladder cancer protein. In one embodiment, a patient is immunized with a 
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DNA vaccine comprising a plurality of nucleotide sequences derived from a bladder cancer 
gene. For example, bladder cancer-associated genes or sequence encoding subfragments of a 
bladder cancer protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
5 responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the bladder cancer polypeptide encoded by the DNA vaccine. 

10 Additional or alternative adjuvants are available. 

In another preferred embodiment bladder cancer genes find use in generating animal 
models of bladder cancer. When the bladder cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the bladder cancer gene will also diminish or repress expression of the gene. Animal 

1 5 models of bladder cancer find use in screening for modulators of a bladder cancer-associated 
sequence or modulators of bladder cancer. Similarly, transgenic animal technology including 
gene knockout technology, e.g., as a result of homologous recombination with an appropriate 
gene targeting vector, will result in the absence or increased expression of the bladder cancer 
protein. When desired, tissue-specific expression or knockout of the bladder cancer protein 

20 may be necessary. 

It is also possible that the bladder cancer protein is overexpressed in bladder cancer. 
As such, transgenic animals can be generated that overexpress the bladder cancer protein. 
Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 

25 determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of bladder cancer and are 
additionally useful in screening for modulators to treat bladder cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 
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For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In the diagnostic and research applications such kits may 
include one or more of the following: assay reagents, buffers, bladder cancer-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense or inhibitory 
5 polynucleotides, ribozymes, dominant negative bladder cancer polypeptides or 

polynucleotides, small molecules inhibitors of bladder cancer-associated sequences etc. A 
therapeutic product may include sterile saline or another pharmaceutically acceptable 
emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions (e.g., 

10 protocols) for the practice of the methods of this invention. "While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 

1 5 media may include addresses to internet sites that provide such instructional materials. 

Tlie present invention also provides for kits for screening for modulators of bladder 
cancer-associated sequences. Such kits can be prepared firom readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a 
bladder cancer-associated polypeptide or polynucleotide, control positive or negative 

20 samples, reaction tubes, and instructions for testing bladder cancer-associated activity. 

Optionally, the kit contains biologically active bladder cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the 
intended user of the kit and the particular needs of the user. Diagnosis would typically 
involve evaluation of a plurality of genes or products. The genes will be selected based on 

25 correlations with important parameters in disease which may be identified in historical or 
outcome data. 
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EXAMPLES 

Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and analyzed 
using gene chips. RNA was isolated and gene chip analysis was performed as described 
5 (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981-993). 



TABLE DESCRIPTIONS 
Table lA shows about 3413 that exhibit increased or decreased expression in bladder cancer 
10 samples. See USSN 60/302,814. 

Table 2A shows about 485 genes overexpressed in bladder tumors relative to normal tissues 
as analyzed using the Afiymetrix/Eos Hu03 GeneChip array. See USSN 60/343,705. 

15 Table 3 A shows about 414 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as small molecule, antibody, DNA vaccine targets for the 
therapy of bladder cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from 
this analysis was expressed as average intensity (Al), a normalized value reflecting the 

20 relative level of mUNA expression. 

Table 4A shows about 129 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as diagnostics of bladder cancer. These genes were selected 
from 59680 probesets on the Eos/Affymetrix HuOS Genechip array. Gene expression data for 
25 each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Table 5A shows about 149 genes upregulated in bladder cancer relative to normal body 
tissues. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
30 Genechip array. Gene expression data for each probeset obtained from tlais analj^is was 

expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

Table 6A shows about 199 genes upregulated in bladder cancer relative to normal bladder 
35 tissue. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was 
expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

40 Table 7A shows about 63 genes downregulated in bladder tumors relative to normal bladder. 
These genes were selected from 59680 probesets on the Eos/Affymetrix HuOS Genechip 
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array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (AI), a normalized value reflecting the relative level of mRNA expression. 

Table 8 A shows about 1440 genes upregulated in Ta or Tl bladder tumors from patients who 
5 later presented with muscle-invasive bladder tumors (stage T2-T4). Gene expression data for 
each probeset obtained from this analysis was expressed as average intensity (AI), a 
normalized value reflecting the relative level of mRNA expression. 

Table 9A shows about 1200 genes upregulated in Ta or Tl tumors of patients who later 
10 presented with either more Ta tumors or no tumors at all. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (AI), a normaKzed 
value reflecting the relative level of mRNA expression. 

Table lOA shows about 65 genes upregulated in non-invasive exophytic Ta bladder tumors 
15 relative to 'T2-T4 muscle-invasive tumors. Gene expression data for each probeset obtained 
from this analysis was expressed as average intensity (AI), a normalized value reflecting the 
relative level of mRNA expression. 

Table 1 lA shows about 106 genes upregulated in muscle-invasive T2-T4 bladder tumors 
20 relative to non-invasive exophytic Ta bladder tumors. Gene expression data for each 

probeset obtained from this analysis was e3q)ressed as average intensity (AT), a normalized 
value reflecting the relative level of mRNA expression. 

Table 12A shows the Pkey, ExAccn, UnigenelD, and Unigene Title for all of the sequences 
25 in Table 13. Seq ID No. is used to link Table 12A to table 13. 

Tables 1B-12B show the accession numbers for those Pkey's lacking UnigenelD's for tables 
1 A -12A, respectively. For each probeset is listed a gene cluster number from which 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
30 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland CaUfomia). Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

35 Tables 1C-12C show genomic positioning for Pkey's lacking Unigene ZD's and accession 
numbers for tables 1 A -12A, respectively. For each predicted exon, is listed genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 
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early sl^e If target is an earl stage {Ta) bladder tumor marker or 
late stage if target is a late stage (T2-T4) bladder tumor marker or 
T2-T4 grade 3 papilloma marker or T2-T4 grade 3 solid tumor marker or Upregulate stage 
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445850 A1262049 

446406 A1553581 

446500 U78093 

447595 AW379130 

447918 AI129320 

448076 AJ133123 

448283 AI340452 

448303 BE622468 



448429 D17408 

448666 AI636697 

448901 AK001021 

448999 AF1 79274 

449226 AB002365 

449238 AA428229 

449422 AA001373 

449690 AA002140 

449874 AA1 35688 

449926 AI342493 

460300 AL041440 

450578 AI971773 

450810 BE207588 

450831 R37974 

451331 AK002039 

461533 NM 004657 

451782 ■ 

451948 AW452473 

462422 AA621416 

452463 R36452 

452776 AA1 94540 

462814 AI092790 

453064 R40334 

453351 A1525721 

463355 AW295374 

463359 AA448787 

453464 AI884911 

463500 A1478427 

453582 AW854339 

453586 AA248089 

453666 AW0156B1 

453698 AA037615 

453702 AA037637 

463725 W28543 



458622 AA972412 



400977 
401024 
401048 
401537 
401619 



oxidalive 3 alpha hydtoxyslerold dehydro 



Hs.25511 
Hs.25255 
Hs.25243 



454471 AW902125 

454637 AW811613 

454750 AW866285 

455073 AW854829 

455485 AA1 02287 

455611 L06419 

456100 AI983981 

456841 AA875863 

467064 AA776743 



sushi-repeal-containing protein, X chrom 



Hs.20196 
Hs.182979 
Hs.11924 
Hs.233401 
Hs.21223 



Hs.85524 
Hs.59a21 
Hs.33024 



muscle-specllfc RING-finger protein homo 



Homo sapiens cDNA FU20767 fis, clone CO 



Hs.27018 Ris 



Hs.55016 
H8.301395 
Hs.61814 
Hs.31412 
Hs.24872 
Hs.32989 
Hs.43125 
Hs.33476 
Hs.50841 



ESTs 
ESTs 

hypothetical protein FLJ21935 

Homo sapiens cDNA: FU21204 lis, done C 

Homo sapiens cDNA: FLJ22750 lis, done K 

Homo sapiens cDNA FU 11 422 fis, clone HE 

ESTs, Wfeakly similar to aortic carlxjuype 

receptor (calcitonin) activity modi^ing 

ESTs 

hypothetical protein FU11937 

ESTs, Wealiiy sMlar to tufteiin |M.musc 

ESTs, Moderately similar to AF107203 1 a 



gb:48c5 Human retina cDNA randomly prime 
eukaryotic translation initiation factor 
secreted modular calcium-binding protein 
gb:QV()-NN1022.120500-220-h12 IMN1022 Homo 
gb:CM3-ST0157-300999-017-f06 ST0157 Homo 
gb:QV4-SN0024080400-167-a09 SN0024 Homo 
gb;QV2-C;T0261-201099-011-f01 CT0261 Homo 
Homo sapiens cDNA: FU20896 lis, clone A 
procollagen-lysine, 2-oxoglutarate 5-dio 



Hs.152345 
Hs.191589 
Hs.1 08479 



Hs.290656 EST 



downregulate stage 
downregulate stage 
downregulate stage 
downregulate stage 



downregulate stage 
downregulate stage 
downregulate stage 
downregulate stage 
downregulate stage 
downregulate stage 



downregulate stage 
downregulate stage 
downregulate stage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate slage 
downregulate stage 
downregulate slage 
downregulate slage 
downregulate stage 
downregulate slage 
downregulate stage 
downregulate stage 
downregulate slage 
downregulate slage 
downregulate stage 
downregulate slage 
downregulate stage 
downregulate stage 
downregulate slage 



0.41 2,45 downreg 



downregulate slage 
downregulate stage 
downregulate stage 
downregulate stage 
early slage 
eariy stage 
early stage 
eaity stage 
eaity stage 
early stage 
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early stage 
early slage 
early stage 



404335 

404345 AA730407 Hs.1 69166 

404501 AWZ47252 H3.75514 

404594 

404874 

404881 

404896 NI^W0429 Hs.106845 
404999 U68196 Hs.296281 



30 

35 
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75 



406881 D16154 

407411 AF060170 

407639 AW205369 

408112 AW451982 

408732 AL117490 

409103 AF251237 

409840 AW502122 

410128 AW904599 

411474 AW848427 

412564 X837D3 

413266 BE3C0352 

413341 H78472 

414056 AW818687 

414170 AA335996 

414220 BE298094 

414276 BE297862 

414327 BE408145 



Hs.191325 

Hs.6366 

Hs.3743 



417371 N74613 



422669 H12402 
422743 BE3Q4578 
422760 BE409561 



427827 AA416577 
428651 AFig6478 
430727 X75917 



Hs.170219 
Hs.193718 
Hs.121269 
Hs.189105 
Hs.1 88401 
Hs.2654 



Hs.252936 
Hs.248613 
Hs.47225 
Hs.1 12208 



gb;ta49g09.x1 Na.CGAP.Lu25 Homo sapiens 
gb'.Humangene for cytochrome P-450c11, e 
gb:Homo sapiens AS1 2 protein mIRNA, parti 
ESTs 



ESTs 

Homo sapiens mPMA; clDNA IDKFZp434N211 (fr 
XAGE-1 protein 

gb:UI.HF-BR0p-£!y-c-08-0-Ul.r1 NIH.IVIGC.5 
gb:RC1-NN1063-260400^ 1-h05 NN1063 Homo 
gb:IL3-CT0214-150200^75-H10 CT0214 Homo 
caidlac ankyrln repeat protein 
gb«00944231F1 NIH_MGCJ7 Hano sapiens o 
ESTs, Weakly similar to cDNA EST yk41 4c9 
Homo sapiens cDNA; FLJ21522 lis, done C 



gb:601113231F1 NIH.MGCJ7H0 
gb:601174780F1 NIH_1VIGCJ7 Horaosapien 
ESTs, Moderately similar to I>IAC-1 prolei 
gb:601075456F1 NIH.MGC_12Honi0 5aplen 
ESTs, Weakly similar to 16.7Kd protein [ 
phosphollpase A2, group HA (platelets, 
ESTs 

gb.ym27li06.r1 Soares infeni brain 1 NIB H 



Hs.293490 ESTs 



gb;on89e04,s1 Soares NFU.T_GBC_S1 Homos 
ESTs 

Kruppel-likefactor2(iung) 

gb:lz49b05.y1 NCLCGAP_Bm52 Homo sapien 

ribosomai protein L3 

gb;601299865F1 NIH_I^GC.21 Homo sapiens c 
glulathione reduclase 
paired mesoderm homeo box 1 
ribose 5-phosphate Isomerase A (rlbose 5 
gb'.601076309F1 NIH_MGC_12 Home sapiens c 
gb:601064837F1NIH_MGCJ0I' 
tiypothelical protein 
ESTs 



UHC binding factor, beta 



early stage 
early stage 
early stage 
early stage 
early stage 

early stage 
early stage 
ear^ stage 

earVsSe 
early stage 
ear^ stage 

early stage 
early stage 
early stage 

early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early stage 

early stage 



early stage 

early stage 
early stage 
early stage 
early stage 
early stage 
early stage 
early slage 
early stage 
early stage 
early slage 



0.45 0.66 early stage 

1.45 0.12 early stage 

0.33 0.71 early stage 

0.52 067 early stage 

1 026 early stage 

0.2 0.57 early stage 

0.41 0.64 early stage 

3.75 0.1 early stage 

0.55 0.54 early stage 

1 0.21 early stage 

0.28 0.85 early stage 

0.31 0.69 early stage 

0.11 0.69 earlystage 

1 055 earV stage 

0.33 0.71 earlystage 

1.16 0.41 earlystage 

1.85 024 earlystage 

0.78 0.46 earty stage 
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AI650360 


Hs.100256 












AW97139a 




gb:EST363487 MAGE 'esequences, MAGLHomo 






early stage 


431900 


AW972048 










early stage 




NM_006979 


Hs.278721 












NM 014554 


Hs 278949 










433404 




Hs. 1027 20 












AF090945 




gb:Hcmo sapiens done HQ0670 






early stage 














early stage 




AA223646 


Hs. 57222 


,iun'm (nucjear enyeiope membrane protein 




0.49 




435752 


AF230801 


Hs.125180 


growth tiormone receptor 


0.62 


0.4 


early stage 


436178 


BEl 52396 


Hs.21590 


Horro sapiens HSPC304 mRNA, partial cds 


1.65 


0.14 






AJ227892 


Hs.146274 








early stage 




AI793222 


Hs.1 66817 




o!i7 


1.46 


early stage 






Hs. 2931 30 








early stage 




AW975714 


Hs.129004 










436869 


NkL014867 


Hs.297661 








early stage 


















AA813136 














AW384B1 5 


Hs.149208 


KIAA1555 protein 










AA825976 
















Hs.265499 












AA962623 




ESTs, Wealdy similar to NPT2_HUMAN RENAL 








440397 


AA884448 


Hs.157239 










440509 


BE410132 


Hs.1 34202 


ESTs, Weal<ly similar to B41 1 82 collagsn 






early stage 


440605 
















AW1 35374 


Hs.270949 










443258 




Hsigoga 


sulfate transporter 1 


0.85 


0.49 


early stage 




AW1 36354 


Hs. 145303 


























AI525135 


Hs.210507 








early stage 


















AI560776 


Hs.1 99854 












AF074913 










early stage 




BE616709 




gb:601 279937F1 NIHJi/IGC 39 Homo sapiens c 










AI633836 














BE616861 




gb:601279056F1 NIH_MGC 39 Homo sapiens c 






early stage 




BE410360 




gb:601302340F1 NIH_MGCl21 Homo sapiens c 


0^27 


076 


early stage 








ESTs, Moderately similar to Atij7_HUMAN A 










AI571000 














AW957906 






028 


0'83 


early stage 




A1737406 
H83979 


Hs.207026 


ESTs, Weakly similar to B56205 t-anscrip 
gb:ys93d1 1 .r1 Scares refina N2b5HR Homo 










AI792066 


Hs.283902 


Homo sapiens BAG clone RP1 1-48 1J1 3 from 






earV stage 




AI9D5518 




gb:RC-BT091-210199-098 BT091 Homosapien 










AAQ26735 




gb:ze93dQ5.r1 Soares_fetELhearLNbHH19W 


o!44 




early stage 




AW090524 










ear^ stage 




AW847505 




gb:RC0-CT021O-28099aO21-c10 CT0210 Homo 




o!5 


early stage 




AW813728 




ESTs, Highly similar to AF1 61 358 1 HSPCO 






ear^ stage 




AW835127 




gb:RC4-LT0011-10010ojoi2-o07LT0011 Homo 










AW852286 




gb:QVO-CT0225-100400-187-d08 CT0225 Homo 










AW995589 




gb:QV3-BN0046-150400-151-g09 BN0046 Homo 










AI733857 




























AJ000098 














AA782222 


Hs.1 92008 












AF035528 














A1081687 




























A1286100 




























A1299739 




Homo sapiens cDNA FLJ1 2553 fis, clone MT 






















458767 


797083^ ^ 


Hs.1 48355 


ESTs 


1 


0^17 


early stage 


459399 


BE407712 




gb;601299745F1 NIH.MGC_21 Homo sapiens c 


0.68 


0.56 


early stage 


400860 








0.08 




408190 


ABQ32963 


Hs.43677 


ATPase, Class 1. type 8B, member 2 


0^58 


0.84 




408558 


AW015759 




ESTs 


1.26 


0.45 


late stage 


410077 


AF097645 


Hs!58570 


deleted In cancer 1; RNA helicase HDB/Dl 


6.2 


0.12 


late stage 


410295 


AA741357 


Hs.62041 


nidogen (enactin) 




0.86 




410310 


J02931 


Hs.62192 


coagulation factor III (thromboplastin, 


145 


0.27 




410614 


A1091195 




growth arrest-speciTic 1 




1.12 


late stage 


410867 


X63556 


HsJ50 


{brillini (Marfan syndrome) 


0.71 


1.07 




411573 


AB029000 


Hs.70823 


K1AA1077 protein 


3.64 


0.19 




412116 


AW4021S6 


Hs.784 


Epsteln-Barr virus induced gene 2 (lymph 


5.18 


0.13 


late stage 


412178 


AW89S526 




gb:RC6-NN0072.040500-011-E05 NN0O72Homo 


7.55 


0.08 




412429 


AV650262 


Hs.75765 


GR02 oncogene 


3.37 


0.15 


latestege 


412652 


AI801777 


Hs.6774 


ESTs 


0.49 
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AL1 33396 


Hs.74621 


prion protein (p27-30) (Creulzfeld-Jaliob 










WM_002984 














AW957446 


Hs. 301711 












W62194 


Hs.75929 


























AW069181 




ESTs, Wealdy similar to transfbrmation-r 
























AA1 60363 
































gb:HUM526E07B Clontocti human placenta po 






































AW903838 


Hs.81800 












AL048678 


Hs. 82503 


























AW291587 


















collagen, lype av, aipna i 












Hs.83942 














Hs.233459 












NM_006144 














AA1 59850 




lipoma HMGIC fusion partner 










AI581085 


Hs!24678 












Ai718702 


















retinol-ljinding protein 1, cellular 










BE281591 


Hs.106768 


liypothellcal protein FU 10511 
























AW875237 


























































Hs. 23388 


Homo sapiens cDNA: FtJ21310 Its, clone C 










AW679960 


H3.135150 












AF039018 










































AA740632 


Hs 120850 












BE242284 














AB002298 










































AI63404B 




























AB007863 




KIAA0403 protein 










A108433B 


















ESTs, Weakly similar to alternatively sp 














hexabrachion (lenascin C, cytotacHn) 
























AAB73350 




gb:oh64h02.s1 NCI CGAP Kld5 Homo sapiens 










AL042613 














AA533447 


Hs .2851 73 














Hs.274363 


Homo sapiens mRNA: cDNA DKF2p58611524(f 












Hs]257631 












AW960564 


















gb:yd7dd06.s1 Scares fetal liver spleen 










AW246900 




hypothetical protein 










BEB21807 














Al 302471 


Hs. 124292 


Homo sapiens cDlilA?FLJ23123 fis, clone L 
























AW021103 










































AA428403 












442739 


NM 007274 


Hs!8679 


cylosolic acyl coenzyme A tlibesler hydr 


7^95 


0.06 


late stage 


443852 


AI679966 


HS.150S03 


ESTs 




0.12 


late stage 


443896 


AI680242 


Hs.271687 


Homo sapiens cDNA FU13527 fis, clone PL 


7!95 


0.08 


late stage 


444212 


AW503976 


Hs.10649 


basement membrane-induced gene 


2.31 


0.28 


late stage 


444331 




Hs.24144 


ESTs 


0.32 






445142 


AW978484 


Hs.93842 


Homo sapiens cDNA; FU22654 fis, clone H 


2.52 


0^24 


late stage 


445701 


AF055581 




lymphocyte adaptor pnjtein 


1.43 


047 




446564 


U53446 


Hs!l5432 


downregulated in ovarian oanoar 1 


0.54 


1.39 




447526 


AL048753 


Hs.340 


small inducible cytol<he A2 (monocyte ch 


1.43 


0.43 


late stage 




AA313230 


Hs.19413 


S100 calcium-Undlng protein A12 (oalgra 


1.35 


0.26 


late stage 


447818 


W79940 


Hs.21906 


ESTs 


6.53 


0.13 


late stage 


449567 


AI990790 


HS.1BB614 


ESTs 


4.7 


0.13 


late stage 


460455 


AL117424 


HS.2503S 


chloride Intracellular channel 4 








452239 


AW37937B 


Hs.170121 


protein tyrosine phosphatase, receptor t 


o!59 


118 




452698 


Nl»i001295 


Hs.301921 


ESTs 


2.31 


a26 


latestage 
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453212 


H15416 


Hs,21865 


ESTs 


2.51 


0.26 


late stage 


455510 


AA422029 


Hs.143640 


ESTs, Weakly similar to hyperpdarizaUo 


8.6 


0.06 




400775 


R58624 


Hs.2186 


eukaryotic iranslalion elongation Mr 






T2-T4 grade 3 papilloma mart<er 


401508 










T2-T4 grade 3 papilloma marker 


403092 
404232 












T2-T4 grade 3 papilloma marker 
T2-T4 grade 3 papilloma marker 


407020 


U49973 




gb;HunianTlgger1 transposable element, c 






T2-T4 grade 3 papilloma marker 


407345 


A1053B36 


Hs.169355 


ESTs, Weakly similar to AUI.HUMAN ALU S 






T2-T4 grade 3 papilloma marker 


407420 


AF084362 




gbiHocno sapiens llpoate-protein ligase B 






T2-T4 grade 3 papilloma marker 




AW131324 


Hs.246759 


ESTs, Weakly slmHar to KIAA1074 protein 






T2-T4 grade 3 papilloma marker 


407666 






mo [mothers against decapentaplegte, Dr 






T2-T4 grade 3 papilloma marker 


407916 


L09234 


H3.603 


ATPase, H-i- transporting, lysosomal (vacu 






T2-T4 grade 3 papilloma marker 


407938 


AW118147 




ESTs 






T2-T4 grade 3 papilloma marker 


4081 88 


AW168847 




ESTs 






T2-T4 grade 3 papilloma marker 


408950 


AA707814 




ESTs 






T2-T4 grade 3 papilloma marker 


409033 


T97490 


Hs.50002 


small inducible cytokine subfamily A (Cy 


1.2 




T2-T4 grade 3 papilloma marker 


409045 


AA635052 


Hs.50094 


Homo sapiens mRNA; cDNA DKFZp434O0515 (f 






T2-T4 grade 3 papilloma marker 


409196 


NM_001874 


Hs.169765 


carboxypeptldase M 






T2-T4 grade 3 papilloma marker 




AAC69998 




gb'.zm67b03.r1 Stratagene neuinepithelium 






T2-T4 grade 3 papilloma marker 


410010 


AW572853 


Hs.257533 


ESTs, Weakly similar to ALUS.HUUMN ALU S 




0.6 


T2-T4 grade 3 papilloma marker 


410157 


AW593277 


Hs.225056 


ESTs 






T2-T4 grade 3 papilloma marker 


411112 


AW318158 




gb:CI(11-ST0277-161299-070-g07 ST0277 Homo 






T2-T4 grade 3 papilloma marker 


411336 


AW337675 




gb:QV2-LT0039-260300-107-b04 LT0039 Homo 






T2-T4 grade 3 papilloma marker 


412051 


T15B72 


HS.26B713 


ESTs, Wtekly similar to hypotlielical pro 






T2-T4 grade 3 papilloma mari<er 


413485 


N52628 




gb:yv37g1U1 Scares fetal liver spleen 






T2-T4 grade 3 papilloma marker 


413574 


BE149153 


Hs.129998 


Homo sapiens cDNA FU14267 Us, otone PL 






T2-T4 grade 3 papilloma marker 


413782 


BE546104 




gb:601072642F1 NIH_MGC_12 Homo sapiens c 






T2-T4 grade 3 papilkima maAisr 




Hg4622 


Hs.ig3358 


ESTs, Moderately similar to diabetes mel 






T2-T4 grade 3 papilloma marker 


415293 


R49462 


Hs.106541 


ESTs 






T2-T4 grade 3 papilloma marker 


415442 


F12963 


Hs,7045 


GL004 protein 






T2-T4 grade 3 papilloma marker 


415255 


T87587 


Ks.272082 


ESTs 






T2-T4 grade 3 papilloma marker 


417047 


AA192640 


Hs.1526 


ATPase, Ca-n- transporting, cardiac muscl 






T2-T4 grade 3 papilloma marker 


417131 


LI 01 23 


Hs.1071 


surfactant protein A binding protein 






T2-T4 grade 3 papilloma marker 


417367 


N73877 


Hs.171315 


ESTs 






T2-T4 grade 3 papilloma marker 


419721 


NM_001650 
AA808259 


Hs.288650 


aquaporin 4 






T2-T4 grade 3 papilloma marker 


420294 


Hs.196716 


ESTs 




0.65 


T2-T4 grade 3 papilloma marker 


423539 


AA328082 




ESTs, Weakly similar to llirombospondln t 






T2-T4 grade 3 papilloma marker 


424549 


AI8732C5 




Homo sapiens cWA FU14236 fis, done NT 






T2-T4 grade 3 papilloma marker 


425458 


H89317 


Hs!l82889 


ESTs 






T2-T4 grade 3 papilloma marker 


425475 


AL134728 




gb:DKFZp547A1890_r1 547 (synonym; hfbri) 






T2-T4 grade 3 paplltoma marker 


429453 


AA453195 


Hs.124222 


ESTs 






T2-T4 grade 3 papilloma marker 


431200 


AFC44923 


Hs.250752 


hookl protein 






T2-T4 grade 3 papilloma marker 


431938 


AA938471 


Hs.115242 


developmenlally regulated GTP-bhding pr 






T2-T4 grade 3 papilloma marker 


431944 


A1360891 




ESTs 






T2-T4 grade 3 papilloma marker 


432021 


AA524470 


Hs.58753 


ESTs 






T2-T4 grade 3 papilloma marker 


432205 


AI806583 


Hs.125291 


ESTs 




0.31 


T2-T4 grade 3 papilloma marker 


432527 


AW975028 


Hs.102754 


ESTs 






T2-T4 grade 3 papilloma marker 


434069 


AF116651 


Hs.283058 


hypothetteal protein Pi«)08C0 




0.41 


T2-T4 grade 3 papilloma marker 


435273 


AW994242 


H8.173495 


ESTs 






T2-T4 grade 3 papilloma marker 


435965 


AI034368 


Hs.132650 


ESTs 




0.36 


T2-T4 grade 3 papilloma marker 


438227 


AA706937 


Hs.120802 


ESTs, li/loderately similar lo A26641 Na+/K 






T2-T4 grade 3 papilloma marker 


436635 


AW104325 


Hs.272093 


ESTs, Weakly similar to STK2 HUMAIM SERIN 
ESTs 






T2-T4 grade 3 papilloma marker 


436640 


AA724411 


Hs.156065 






T2-T4 grade 3 papilloma marker 


43S884 


BE046857 




gb:tin42e02.x1 NCLCGAP_i^F2 Homo sapiens 






T2-T4 grade 3 papilloma marker 


437251 


AW976082 




gb:EST388191 MAGE resequences, MAGN Homo 
ESTs 






T2-T4 grade 3 papilloma marker 


437348 


AA749149 


Hs.163114 






T2-T4 grade 3 papilloma marker 


437769 


AA767B53 


Hs.122895 


ESTs 






T2-T4 grade 3 papilloma marker 


437771 


AA811071 


Hs.123349 


ESTs 






T2-T4 grade 3 paplltoma marker 


438347 


AA909686 


Hs.293397 


ESTs 






T2-T4 grade 3 papilloma marker 


439171 


AA831133 


Hs.294128 


ESTs 




0.95 


T2-T4 grade 3 papilloma marker 


439914 


AA854056 


Hs.145394 


ESTs 






T2-T4 grade 3 papilloma marker 


440399 


AI215527 


Hs.125589 


ESTs 






T2-T4 grade 3 papilloma marffflr 


440972 


BE044588 


Hs,276158 


ESTs 


1 


1 


T2-T4 grade 3 papilloma marker 


442004 


AA973558 


Hs.128317 










442270 


BE565399 


Hs.62005 


ESTs 






T2-T4 grade 3 papilloma marker 


443413 


AI056467 


Hs 221642 


ESTs 






T2-T4 grade 3 papilloma marker 


443927 


AW016728 


Hs.134860 


ESTs 






T2-T4 grade 3 papilloma marker 


445442 


N20392 


Hs.42846 


ESTs 






T2-T4 grade 3 papilloma marker 


445511 


AW418497 


Hs.145583 


ESTs 






T2-T4 grade 3 papilloma marker 


445883 


AF070564 


Hs.13415 


Homo sapiens clone 24571 mRNA sequence 






T2-T4 grade 3 papilloma marker 


446552 




Hs.156241 


ESTs 






T2-T4 grade 3 papilloma marker 


447399 


A1815'101 


Hs.251967 


Homo sapiens cbne 785627 unknown mRNA 


2.9 


0.14 


T2-T4 grade 3 papilloma marker 


449111 


T83109 


Hs.196130 


ESTs 






T2-T4 grade 3 papilloma marker 


449232 


AW192780 


Hs.196080 


ESTs 




0.8 


T2-T4 grade 3 papilloma marker 


451373 
452453 


AI792030 




gb-.os03e1 1.y5 NCI_CGAP_Lu5 Homo sapiens 






T2-T4 grade 3 papilloma marker 


AI902519 




gb:QV-BT009-101198^51 BT009 Homo s^len 






T2-T4 grade 3 papilloma maiter 


452534 


AW083022 


Hs.149425 


Homosaplens cDIMA FU11980 ts, done HE 




0.67 


T2-T4 grade 3 papilloma maiter 



101 



wo U3/003906 



PCT/US02/21338 



452536 


BE063380 




gb:PMO-BT0275-291099^02-g10 BT0275 Homo 


1.65 


0.26 


T2-T4 grade 3 papilloma marker 




M027115 


Hs.100205 


ESTs, Weakly similar to AAAD_HUMAN ARYLA 






T2-T4 grade 3 papilloma marker 














T2-T4 grade 3 papilloma marker 




NM 007197 










T2-T4 grade 3 papilloma marker 




AL037925 




gb-.DKFZp564M037_r1 564 (synonyni: hfbt2) 






T2-T4 grade 3 papilloma marker 


453609 












T2-T4 grade 3 papilloma marker 




AL079389 




gb:DKFZp434E2116_r1 434 (synonym: htes3) 






T2-T4 grade 3 papilloma marker 




R41806 












455880 


AW880861 
BE153208 




gb;QVO-DT0033fl70300-152-c12 DT0033 Homo 
gb:PMO-HT0335-050400-007-F10 HT0335 Homo 






T2-T4 grade 3 papilloma marker 




AW835416 


Hs.29417 


HCF-binding traiscription factor Zliangfe 






T2-T4 grade 3 papilloma marker 














T2-T4 grade 3 papilloma marker 


456912 


msm 


Hs!l58112 


protein tyrosine phosplialase, receptorl 


] 




T2-T4 grade 3 papilloma marker 




AA761820 
AW9&7813 
AW971949 


Hs.250955 
HS.201DS4 
Hs^1252 








T2-T4 grade 3 papilloma marker 




AA492071 




gb:ne97l)C4.s1 NCLCGAP_Kid1 Homosaplens 






T2-T4 grade 3 papilloma martter 




AW300248 


Hs.181693 








T2-T4 grade 3 papilloma marker 




AF086561 










T2'T4 grade 3 papilloma mariner 






Hs.181801 










459396 


AI907536 


Hs!l03B69 


ESTs 


1 


1 


T2-T4 grade 3 papilloma marlcer 


401002 








































T2-T4 grade 3 solid tumor marker 


403776 








1 


\3 


T2-T4 grade 3 solid tumor marker 




























T2-T4 grade 3 solid tumor marker 




AL390179 




Homo sapiens mRNA; cDNA DKFZp547P134 (fr 






T2-T4 grade 3 solid tumor marker 


406471 












T2-T4 grade 3 solid tumor marker 






Hs 220529 












AW157431 


Hs.248941 








T2-T4 grade 3 solid tumor marker 




W03754 


Hs.5081 3 










409464 






zinc Snger protein 37a (KOX 21) 






T2-T4 grade 3 solid tumor marker 




AA1 25985 
BE220489 


Hs.1 13592 








T2-T4 grade 3 solid tumor marker 




AW770768 


Hs.266717 












AW866330 




gb.'QV4-SN0024-OB0400-167-e01 SN0024Homo 






T2-T4 grade 3 solid tumor marker 




AW936369 


Hs.69165 


gb;QV4-DT0021-3O1239-O71-(IO7 DT0021 Homo 






T2-T4 grade 3 solid tumor marker 
T2-T4 grade 3 solid tumor marker 




A10S5016 










T2-T4 grade 3 solid tumor marker 




BE242870 




solute carrier family 1 (glial high affi 










AA130158 


Hs.19977 


ESTs, Moderately similar to ALUB.HUMAN A 






T2-T4 grade 3 solid tumor marker 




D31010 




gb:HlJI\lL12147 Human fetal lung Homosapie 






T2-T4 grade 3 solid tumor marker 






HS.22S34 


ESTs 






T2-T4 grade 3 solid tumor marker 




AW968613 




BCL2Jadenovirus E1B 19k[}-interacling pro 






T2-T4 grade 3 solid tumor madcer 




AW247430 










T2-T4 grade 3 solid tumor mariter 




AL033539 




Human Dl^ sequence from clone RP1-309H15 






T2'T4 grade 3 solid tumor marker 




AW873803 


Hs.1 02876 






T2-T4 grade 3 solid tumor marker 


422660 


AW297582 






1.05 


o!32 


T2-T4 grade 3 solid tumor marker 




AA31 8334 




gb:EST20402 Retina II Homo sapiens cDNA 






T2-T4 grade 3 solid tumor marker 






Hs.1 45404 








T2>T4 grade 3 solid tumor marker 




AJ005273 


Hs.1 23647 








T2-T4 grade 3 solid tumor marker 




AW959908 
















Hs.1 44339 












AL037915 


Hs.1 55097 








T2-T4 grade 3 sofid tumor madder 




























T2-T4 grade 3 solid tumor marker 






Hs 249230 


















































435505 


AF200492 


HS711238 


inter1eukln-1 homolog 1 


1.8 


019 


T2-T4 grade 3 solid tumor marker 


436281 


AW411194 


Hs.1 20051 


ESTs 


2.06 


0.14 


T2-T4 grade 3 solid tumor marker 


437010 


AA741368 


Hs.291434 


ESTs 






T2-T4 grade 3 solid tumor marker 


437814 


A1088192 




ESTs, Weakly similar to DDX9_HUMANATP-D 




0.13 


T2-T4 grade 3 solid tumor marker 


438361 


AA805666 


Hs.1 4521 7 


Homo sapiens cDNA: FU23077 lis, clone L 






T2-T4 grade 3 solid tumor marker 


438376 


BE541211 


Hs.34804 


Homo sapiens dm FU11472 lis, clone HE 




0'57 


T2-T4 grade 3 solid tumor marker 


439370 


AW274359 


Hs.1 58353 


ESTs 






T2-T4 grade 3 solid tumor marker 


440021 


AW025498 


Hs.270842 


ESTs, Weaklyslmllarto ALU8_HUMAN ALU S 




0.66 


T2-T4 grade 3 solid tumor marker 


440404 


AI015881 


Hs.125616 


ESTs 




0.26 


T2-T4 grade 3 solid tumor marker 


441523 


AW514263 


Hs.1 68872 


ESTs, weakly similar to ALUF.HUMAN III! 






T2-T4 grade 3 solid tumor marker 


442277 




Hs.202391 


ESTs 


2.4 


0.15 


T2-T4 grade 3 solid tumor mariier 


442738 


AW002370 


Hs.131055 


ESTs 




0.29 


T2-T4 grade 3 solid tumor mariier 


443297 


AI049864 


Hs.1 33029 


ESTs 






T2-T4 grade 3 solid tumor maiter 


444754 


T83911 


Hs.11381 


transmembrane 4 superfamily member 4 




0.61 


T2-T4 grade 3 solid tumor madier 


445550 


AI242754 


Hs.137306 


ESTs 




0.5 


T2-T4 grade 3 solid tumor marker 



102 



wo U3/003906 
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446149 


BE242960 


Hs.203181 


ESTs 




0.25 


T2-T4grade3solidtum( 




446163 


AAC26880 


Hs.25252 


Homo sapiens cDNA FLJ13603 lis, clone PL 




0.21 


T2-T4grade3solidtum( 




446434 


A1823410 




karyopherin alpha 1 (Importin alpha 5) 






T2-T4 grade 3 solid lumc 




446828 


AI694493 


Hs.246916 






0.31 


T2-T4 grade 3 solid tum( 




448591 


A1540111 


Hs,171261 








T2-T4grade3sorrdtumc 




449121 


A1915858 


Hs.194980 


ESTs 






T2-T4grade3solidlum( 








Hs.68446 






0.33 


T2-T4 grade 3 solid tum( 




450451 


AW591528 


Hs.202072 


ESTs 




0.59 


T2-T4 grade 3 solid tumc 




450459 




Hs.281326 


ESTs 




0.43 


T2-T4 grade 3 solid tum( 




450634 


AA872605 


Hs,25333 


Inlerleukin 1 recepto', type 11 




0.05 


T2-T4 grade 3 solid tumc 




451099 


R52795 


Hs,25954 


inlerieukin 13 recepto', alpha 2 


1.55 


0.11 


T2-T4 grade 3 solid tumt 




451106 


BE382701 


Hs,25960 








T2-T4 grade 3 solid tumc 




451130 


Al 762250 


Hs,211347 


ESTs ' ^ 




0.15 


T2-T4 grade 3 solid tumc 




451412 


AW1 36378 


Hs,208060 


ESTs 


1 




T2-T4 grade 3 solid tumc 




451806 


NWL003729 


Hs,27076 


RNA 3'-lerminal phosphate cyclase 


1.35 


0.22 


T2-T4 grade 3 solid tumc 




452114 


N22687 




ESTs 


1 


0.19 


T2-T4grade 3 solid tumc 




452743 


AW9650a2 


Hs,61455 


ESTs 




0.44 


T2-T4 grade 3 solid tumc 


■marker 








gb:HSU70071 Human Homo sapiens cDNAclon 






T2-T4 grade 3 solid tumc 




455235 


AW876951 




gb:CM1-PT0013-131299-067-f09 PT0013 Homo 




0.31 


T2-T4 grade 3 solid tumc 




457792 


AL046988 


Hs,268677 


ESTs, Moderately similar loALU7.HUMANA 




0.24 


T2-T4grade 3 sola tumor 




100147 


D13666 


Hs,136348 


osteoblast specific factor 2 (fasciclin 


20.531 


0.036 


upregulate stage 




101193 


L20861 


Hs.152213 


'wingless-type MMTV integration site lam 




0.526 


upregulate stage 




101724 


U69225 


Hs.620 


bullous pemphigoid anUgen 1 (230/2401(0) 


20.67 


0.037 


upregulate stage 




101309 


M86849 


"H5.323733 


'gap junction protein, beta 2, 26kD (con 


20.78 


0.019 


upregulate stage 




102154 


U17760 


Hs,75517 


"laminin, beta 3 (nicein (125kD), kalini 


Mm 


0.042 


upregulate stage 




102211 


U23070 


Hs.78776 




2.092 


0.28 


upregulate stage 




102523 




Hs,37110 


"melanoma antigen, family A, 9" 




0.306 


upregulate stage 




102915 


X07820 


Hs,2258 


matrix metalioproteinase 10 (slroraelysin 


3.27 


0.041 


upregulate stage 






X54925 


•Hs.83169 


matrix metalioproteinase 1 (interslifal 


13.63 




upregulate stage 




103119 


X63529 


Hs,2a77 


"oadherin 3, type 1, P-cadherin (placeni 


7.296 


o!o54 


upregulate stage 






X82693 


Hs.3185 


"lymphocyte antigen 6 complex, locus CT 


0.908 


0.485 


upregulate stage 










S100 calcium-binding protein A2 






upregulate stage 








Hs,82128 








upregulate stage 






AA459397 








Oiie 
















0.174 


upregulate stage 






AA025418 






l!269 




upregulate stage 






AA023028 




"Homo sapiens Ig super^mfly receptor LN 




o!o49 


upregulate stage 




109166 


AA1 79845 


Hs.73625 


"RAB6 Interacting, kinesin-like (rabkine 




0.039 


upregulate stage 






AA227919 


"Hs.35962 


hyaluronan synthase 3 


1.737 


0.518 


upregulate stage 














0.021 


upregulate stage 
















upregulate stage 






AA253214 


Hs 198249 


"gap junction protein, beta 5 (connexin 


1.932 


0.502 


upregulate stage 




115697 


AA411502 


Hs,63325 




7.394 




upregulate stage 




115978 


AA447522 


"Hs.69517 


"Homo sapiens, clone l;flGC:5257, mtm, com 


1.667 


0.445 


upregulate stage 




116335 


AA495830 


Hs.41690 


desmocoilin 3 


4.899 


0.154 


upregulate stage 




11B314 


M63402 


H5.466g2 


ESTs 


9.75 


0.069 


upregulate stage 






N63604 


Hs.47166 


HT021 


4.601 


0.197 


upregulate stage 




119845 


W79920 


Hs.58661 


G protein-coupled receptor 87 


1.95 


0.123 


upregulate stage 




120486 


AA253400 


Hs.137589 


tumor protein 63 kDa with strong homoiog 


4.191 


0.211 


upregulate stage 






AA398470 


Hs.SSTSS 


'Homo sapiens cDNA: FLJ21245 Is, done 


14.25 


a058 


upregulate stage 




124059 


F13673 


Hs.283713 


'ESTs, Weakly similar to ORF YGLOSOw [S. 


4.99 


ai68 


upregulate stage 




128595 


U31875 


•Hs.1 52677 


"Homo sapiens cDNA FU20338 lis, clone H 


2.433 


a306 


upregulate stage 




128610 


L38608 


Hs.10247 


activated leucocyte cell adhesion molecu 


4.34 




upregulate stage 




129041 


H5B873 


"Hs.169902 


'solute carrier family 2 [facilitated gl 


2.003 


0.455 


upregulate stage 




129466 


L42583 


"Hs.1 11758 


keratin 6A 


11.584 


0.042 


upregulate stage 




130627 


L23808 


Hs.1 695 


matrix metalioproteinase 12 (macrophage 




0.233 


upregulate stage 




132349 


Y00705 


"Hs.181286 


"serine protease inhibitor, Kazal type 1 




ai32 


upregulate stage 




132710 


W93726 


Hs.55279 


"serine (or cysteine) proteinase inhibit 


3!888 




upregulate stage 






X57579 


Hs,727 


"inhibin, beta A (aclivin A, actlvin AB 




o!334 


upregulate stage 






AA242758 


"Hs.79136 


"LlV-1 protein, estrogen regulated" 


2.221 


0.387 


upregulate stage 






X07820 


Hs,2258 


matrix metalioproteinase 10 (stromelysin 


4.85 


0.03 


upregulate stage 










hypothetical protein DKFZp56401278 






upregulate stage 




400346 


AB041269 


Hs!272263 


Homo sapiens mRNA for keratin 19, partia 


8.95 


0.07 


upregulate stage 




400419 




Hs.81800 


chondroitin sulfate proteoglycan 2 (vers 


10.7 




upregulate stage 




400495 












upregulate stage 




400509 




Hs.155585 


receptor tyrosine kinase-like orphan rec 


1.52 


a5i 


upregulate stage 




400528 








3.47 


0.23 


upregulate stage 




400577 










0.29 


upregulate stage 




40060S 








7.2 


Q08 


upregulate stage 




400644 












upregulate stage 




400666 








1.42 


0.43 


upregulate stage 




400750 








8.7 




upregulate stage 




400773 












upregulate stage 




400844 








9]65 


ao4 


upregulate stage 




400845 








2.3 


0.28 






400846 








1.34 


as 


upregulate stage 




400880 








9.4 


0.06 


upregulate stage 
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PCT/US02/21338 



60 
65 
70 



400887 












upregulale stage 


401086 










0.51 


upregulate stage 


401093 








7 


0.08 


upregulate stage 


401101 










0.17 


upregulate stage 


401197 








5.18 


0.14 


upregulale stage 


401262 












upregulate stage 


401271 












upregulale stage 


401279 








9.1 


0.06 


upregulate stage 


401342 








1.42 


0.5 


upregulate stage 


401345 


M83738 


Hs.147663 


protein tyrosine ptiosphatase, non-recept 


1 


0.33 


upregulate stage 


401365 








6.5 




upregulate stage 


401395 










0.31 


upregulate stage 


401420 












upregulate stage 


401439 








Z66 


0.17 


Upregulate stage 










12 


0.05 


upregulate stage 


401599 


BE247275 


Hs.151787 


USsnRNP-specilic protein, 1 16 i<D 




0.08 


upregulale stage 


401600 


BE247275 


HS.1517B7 


US snRNP-spedlic protein, 1 16 i<D 


8.75 


0.09 


upregulate stage 


401694 












upregulate stage 


401747 








29.75 


0.02 


upregulale stage 


401759 








11.35 


0.06 


upregulale stage 










615 


0.07 


upregulate stage 


401858 


AC005261 


Hs.g8338 


serine/tlireonine liinase 13 (aurorayiPLI- 






upregulate stage 


401994 






ai6 


0.15 


upregulale stage 


402001 








44 


0.14 


upregulale stage 


402230 








8.75 


0.06 


upregulate stage 


402325 










0.36 


upregulate stage 


402408 








5.15 


0.1 


upregulale stage 


402472 








9.05 


0.08 


upregulale stage 


402480 












upregulate stage 


402490 








9.6 


0.07 


upregulate stage 


402553 








9.85 


0.09 


upregulale stage 


402889 








9.4 


0.09 


upregulale stage 


402901 








1.07 


0.85 


upregulate stage 


402938 












upregulate stage 


402995 








9.6 


0.06 


upregulate stage 


403005 








1.6 


0.21 


upregulale stage 


403020 








5.15 


0.12 


upregulate stage 


403052 


R5a624 


Hs,2186 


eul(aryotic translation elongation factor 






upregulate stage 


403053 


R58624 


Hs.2186 


eukaryotic translaHon elongation factor 


1.5 


0.28 


upregulale stage 


403073 










0.37 


upregulate stage 


403085 










0.43 


upregulale stage 


403105 








1.12 


0.57 


upregulate stage 


403152 


AA576664 


HS.3707B 


v-crk avian sarcoma virus CT1 0 oncogene 


0.86 


1.08 


upregulale stage 


403172 








7.7 




upregulale stage 


403212 








1.18 


0.62 


upregulate stage 


403214 








6.05 


0.1 


upregulale stage 


403277 








4.5 




upregulale stage 


403331 








32 


o!i3 


upregulate stage 


403361 








10.7 


0.05 


upregulale stage 


403485 








10.35 


0.08 


upregulate stage 


403588 












upregulate stage 


403851 








2.45 


0.34 


upregulale stage 


403860 










1 


upregulale stage 


403894 








445 




upregulate stage 


403903 








1.39 


0.58 


upregulate stage 


403954 


W28077 


Hs.79389 


nel(chicken)-like2 






upregulale stage 


404148 






9.15 


0.08 


upregulale stage 


404229 








1 




upregulate stage 


404258 












upregulate stage 










1.3 


0.2 


upregulate stage 


404288 










0.39 


upregulale stage 


404403 










0.28 


upregulate stage 


404440 








7.05 


0.06 


upregulale stage 


404507 










0.33 


upregulate stage 


404516 












upregulate stage 


404539 












upregulate stage 


404684 








0.89 


0.9 


upregulate stage 


404585 










0.26 


upregulate stage 


404704 








9.3S 


0.08 


upregulale stage 


404829 










0.24 


upregulate stage 


404860 








3.65 


0.15 


upregulale stage 


404894 








2.05 


0.16 


upregulale stage 


404939 










1 


upregulale stage 


405034 


AL035754 


Hs.2474 


toll-like receptor 1 




0.18 


upregulate stage 


405059 










0.56 


upregulale stage 


405064 










0.22 


upregulate stage 


405102 








9.65 


0.08 


upregulate stage 
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upregulale 



405494 
405520 
405526 
405725 
405738 



405906 
40S137 
406187 
406322 
406360 



408588 

406651- AI559224 

406665 U22961 

405671 AA129547 

406687 M31126 

406732 AA487229 

406747 A1925153 



406944 J04742 

405950 LI 7325 

406961 L77563 

406964 M21305 

■ 406993 S83249 

407017 U48697 

407073 Y10510 

407105 S64S99 

407128 R83312 

407132 T02871 

407137 T97307 

407158 N49839 

407175 T85503 



407192 AA609200 

407195 C21124 

407202 N58172 

407204 R41933 

407205 R78910 
407211 T95828 
407346 AI090210 
407422 AF1 16633 
407494 U10072 
407547 Y10259 
407564 AA042860 
407603 AW955705 
407634 AW016569 
407668 BE 161086 

407709 AA456135 

407710 AW022727 
407725 BE388094 
407729 T40707 
407774 AA084958 
407788 BE514982 
407811 AW1 90902 
407813 AL1 20247 
407833 AW955632 
407839 AA04S144 
407853 AA336797 

407881 AW072003 

407882 A1241264 



Hs.79389 

Hs.277477 

Hs.75442 

Hs.285754 

Hs,272620 

Hs.2064 

Hs.217493 

Hs,217493 

Hs.288036 

Hs.172928 

Hs.247945 



Hs.237260 
Hs.228523 EST 
Hs.199067 



major hislocompaliMily complex, class 



aaiiexiaA2 

tRNA isopenlenylpiytophosphate transferas 
collagen, type I, alpha 1 
gb:Human spleen PABL (pseudoaulosomal bo 
Human autonomous replicating sequence H1 
pre-T/NK cell assodated protein 
gb:Homo sapiens DGS-F partial mRNA. 
Human alpha satellite and satellite 3 ju 
gb;NG-TRA=lransporterp(oteln/pulatlveh 
gb:Human mariner-Ike element-containing 
gb:H.sapiens mRNA for CD67S protein. 

EST 



Hs.109370 ESTs 

Hs]272620 
HS.23CC7C 
Hs.264106 



v-ert>b2 avian eryihroUasiic leukemia v 
gb:yz08b10,s1 Soanes_miilllple_sclerDsis_ 
gb:yd87d1 2^1 Scares fetal liver spleen 
gb;zl74b11.s1 SoafesL.testis_NHT Homo sap 
gb:ae37e03.s1 Gessler Wilms tumor Homo s 
gb;af12e02.s1 SoarBS_teslis_NHT Homo sap 
gb:HUMGS0002072 Human aduR (K.Okubo) Ho 



HS.103C05 ESTs 



gb:Homo sapiens PR01318 mRNA, complete c 
gb;Human fbrkhead famly (AFX1) mRNA, pa 
H.sapiens ACm receptor mRNA 3UTR. 



ESTs, Highly slmilartoAI^41831 1 intra 



Hs,23616 
Hs.21857 
Hs.270862 

Hs,38991 
Hs.40098 
Hs,40109 
Hs.66666 
Hs.161566 
Hs,404gg 
Hs.40968 
Hs.62772 



gb:zn13d12j1 Stratagene hNT neuron (937 
S100 calcium-binding protein A2 
cysteine knot supei^lly 1, BMP antagan 
K1AA0872 protein 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale st^e 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale slage 
upregulale stage 
upregulale stage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale stage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 

upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
' upregulale slage 
upregulale stage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale slage 
upregulale stage 
upregulale stage 
upregulale slage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
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70 
75 



407910 AA650274 

407911 AF104922 

407912 AW104401 
407935 U31986 
407939 WG5608 

407944 R34008 

407945 X59208 

407946 AA226495 
407949 W21874 
407974 AW96ai23 
407983 U40371 
407994 AW135309 
408000 L1169C 
408014 AA723782 
408031 AA081395 
408046 AW139121 
408063 BE0a6548 
408092 NM.007057 
408101 AW968504 
408141 U89205 
408170 AW204516 
408184 AW168741 
408224 AW175997 
408239 AA063401 
408241 AW176546 
408268 AL138247 
408277 AW177959 
408306 BE141991 
408352 AA053875 
408360 AI8O6090 
408393 AW015318 

408442 Rsgeoa 

408514 AW206559 

408572 AA055611 

408617 R61736 

408633 AW963372 

408706 AW438503 

408713 NM_001248 

408725 AA131539 

408728 AL137379 

408738 NM.014785 

408739 W01556 
408754 N31256 
408766 AA057268 
408805 H69912 
408813 AI580090 
408817 AA524525 
408849 BE219451 
408902 AW014869 
408908 BE296227 
408916 AW295232 
408933 AA058979 
408943 NM,007070 
408960 BE158389 
409032 AW301807 
409093 BE243834 
409099 AK000725 
409142 AL1 36877 
409203 AA780473 
409231 AA446644 
409262 AK000631 
409357 M73628 
409402 AF208234 
409405 AA075869 
409408 AW387837 
409420 Z15008 



409576 AW419225 

409582 R27430 

409632 W74001 

409642 AW450309 

409674 A1935146 



fibronecUn leucine rich transmembrane p 



HS.9531C 
Hs!23165 



Hs.182133 
Hs!300g76 



ESTs, Weakly slnillartoAF151881 1 CGl-1 
cartilage paired-class homeoproteln 1 
gb:zaa5e07.r1SoaresJetalJung_NbHL19W 



Hs.41296 
Hs.243489 

Hs.239727 
Hs.606 
Hs.154292 

Hs]l46401 
Hs.41718 
Hs.244331 
Hs.620 
Hs.41749 
Hs.42173 
Hs.183643 
Hs.4234e 
Hs.42660 
Hs.123073 
Hs.45152 
Hs.31836 
Hs.22249 

gb:QV0-BT0078-1 90899-005-E02 BT0078 Homo 
Hs.271827 ESTs, Moderately similar to ALU7_HUMAN A 

gb;MRO-CT0063-200899-001-a01 CT0063Homo 
gb:DKFZp547D237_r1 547 (synonym: hfbri) 
gb;lL3-HT0060-200899-OOB-D03 HT0060 Homo 
gb;PM2-HT0134-220999-0O2-d10 HT0134 Homo 



ATPase, Cu-w- transporting, alplia polypep 



bullous pemphigoid antigen 1 (230/240kD) 
protein kinase, cGk/IP-dependent, type II 
Homo sapiens cDNA aJ10366 lis, clone NT 
ESTs 

cateineurin-binding protein calsarcin-1 

ZWIOinteraclor 

CDC2-related protein kinase 7 

ESTs, Moderately similar to neurogenfc b 



ESTs 



hypotheOcal protein FLJ20534 



ESTs, Moderately similar to 



Hs.46677 
Hs.256935 
Hs.47042 
Hs.15669 
Hs.47125 
Hs.47313 
Hs.238797 
Hs.1 61623 
Hs.146013 
Hs.48269 
Hs.48295 
Hs.279864 
Hs.254919 



eclonudeoside triphosphate diphosphohyd 



serine/threonine kinase 15 



ESTs 
ESTs 

CGI-04 protein 

liypolhetlcal protein F1J20718 
chromosome-associated polypeptide C 
cytoclirome P450, subfamily !VB, pciypept 
tumor-associated calcium signal transduc 
hypothetical protein FU20624 
casein, kappa 
cystalinB(steflnB) 

ESTs, Highly similar to RL39.HUMAN 508 R 
gb;MR4-ST0118-0212gW)21-fl)8 ST0118 Homo 
lamlnin, gamma 2 (nicein (lOOkD), kalini 



Hs.256247 
Hs.271565 
Hs.5527g 
Hs.257347 
Hs,278611 
Hs,246042 
Hs.56009 
Hs.94149 



upregulate stage 
upregulaleslage 
upregulate stage 
upregulaleslage 
upregulate stage 
upregulate stage 
upregulaleslage 
upregulate stage 
upregulate stage 
upregulaleslage 
upregulaleslage 
upregulaleslage 
upregulaleslage 
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upregulaleslage 



upregulate stage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulaleslage 
upregulate slage 
upregulate stage 
upregulaleslage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulaleslage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulaleslage 
upregulate stage 
upregulate slage 
upregulate stage 
upregulate stage 



upregulate slage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulate stage 

upregulate slage 
upregulate slage 
upregulate stage 
upregulate stage 
upregulaleslage 
upregulaleslage 
upregulate slage 
upregulate slage 
upregulaleslage 
upregulate slage 
upregulate slage 



106 



wo 03/003906 



PCT/LS02/21338 



10 
15 
20 



45 

50 

55 





AA302840 
BE256027 


Hs. 180946 


gb:EST10534 Adipose tissue* white 1 Homo 










AWB85691 




gb:RC4.OT^1-240300-013-b04OT0071 Homo 










AWB05510 




hypottiefical protein FU21634 








409985 


AW291944 


Hs!l22139 


ESTs 


435 


o:i4 


upregutale stage 




AW960597 


Hs.30164 


























BE566742 


Hs.58169 












AW582568 




gb;RC1-ST0278-080100-01 1-h04 ST0278 Homo 










AW248508 


Hs. 279727 


Homo sapiens cDNA FLJ14035 fis, clone HE 










AW590540 














AK001586 




hypothetical protein FLJ10724 










BE311926 




Hcffno sapiens cDNA FIJ12691 fis, clone NT 










A)468210 


Hs.261285 


pleiotropic regulator 1 (PRL1, Araixdops 










At 936442 




h/potheiical protein FU 10808 
























AK000337 














AW614396 




























BE244668 














AA831460 












410399 


BE068889 














AA224053 




ESTs, Moderately similar tc 152835 H-NU C 










AA310600 




























AW749927 




gb:QV0-BT0537;231299-049---0rBT0537 Hono 
























A1675688 














AW975746 


















gb;IL3-CT0219.281099'024-A03 CT0219 Homo 










AW9051 38 




gb:QVO-NN1071-280400-207.g07 NN1071 Homo 














gbiPMI -CT0247-C80100-008-e10 CT0247 Homo 










AW016824 














N29220 




gb'.yx43b05.r1 Soares melanccyie 2NbHM Ho 
























AW858528 




gb:CM3-CT0341-15030O.119-h11 CT0341 Homo 
























AW388653 


Hs.266859 


























BE379794 


Hs.65403 












AW368860 


Hs.293950 










AA357918 




gb:EST66726 Fetal lung til Homo sapiens 














gbvc16b10.s1 Stratagenelung (937210) H 
























AW978159 


Hs.250164 


ESTs, Wealdy sin^lar to coded tor by C. 










AW504860 


Hs.288836 


Homo sapiens cDNA FU 12673 fis, done 






















U64820 














































gb:MR4.ST0125.021199-017-d08ST0125 Homo 














gb:RC0-ST0174-21 1099-01 1-h12ST0174 Homo 




























gbiRCS-STOI 86-3001 00-017-e04 ST0186 Homo 




















































AW819191 




gb:CMl-ST0283-071299-061-d08 ST0283 Homo 
























AW819867 




gb;QVO-ST0294-070300.1 51-f02 ST0294"Homo 








411159 


AW820178 




gb:QV0-ST0294-100400.185-e07 ST0294 Homo 


1 


0^27 


upregulate stage 


411170 


AW820503 




gb:QV2-ST0298-1 40200-042-b05 ST0298 Homo 






upregulalB slags 


411193 


AW821484 




gb:lL2-ST031 1-211299-028-F12 STC311 Homo 




0.24 


upregulate stage 


411242 


BE146808 




gb:QV4-HT0222-181099.013-g03HTC222 Homo 


2.55 


0.26 


upregulate stage 


411245 


AW333441 




gb:QV4-TT0008-271099-020-g01 TT0008 Homo 


9.62 


0.09 


upregulate stage 


411263 


BE297802 


Hs.69360 


kinesin-lko 6 (mitotic centromere-assoc 


2.4 


0.32 


upregulate stage 


411282 


AW995011 




gb:QV0-BN0040-170300-161-d07 BN0040 Homo 






upregulate stage 


411284 


N285ig 


Hs.1 35191 


ESTs, weakly sWiar to unnamed protein 


3.25 


0.12 


upregulate stage 


411294 


AW859729 


Hs.42680 


ESTs 






upregulate stage 


411327 


AW836922 




gb:QV1-LT0036-1 50200-074^106 LT0036 Homo 




0.37 


upregulate stage 


411338 


AW731782 


Hs.1 16122 


ESTs, Weakly similar to unnamed protein 


5 


0.13 


upregulate stage 


411339 


BE164Sg8 




gb:RC3-Hr0470-120200-013-b10 HT0470 Homo 




0.25 


upregulate stage 


411383 


AA001394 


Hs.69749 


KIAA0087 gene product 


3.6 


0.18 


upregulate stage 


411387 


AW842339 


Hs.130815 


hypothetlcai protein FU21870 


8.75 




upregulate stage 


411400 


AA311919 


Hs.69851 


GARI protein 


12.1 
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411425 AW846012 

411461 AW847937 

411526 AW850327 

411560 AW851186 

411568 BE144593 

411571 AA122393 

411605 AW006831 

411626 AW793453 

411630 U42349 

411643 AI924519 

411653 AF070578 

411771 AW993247 

411787 AW863568 

411788 AW897793 
411826 AW947946 
411835 U29343 
411860 T89420 
411874 AA096106 



411943 BE502436 

411946 AL033527 

411991 X58822 

412040 086519 

412088 AI689496 

412134 AW895560 

412140 AA219691 

412231 AW902491 

412296 AW935233 

412327 AW937355 

412357 AW939537 

412359 AW837985 

412367 AW945964 

412529 BE271224 

412537 AL031778 

412547 W27161 

412559 T31474 

412636 NM.004415 

412648 AA1 15211 

412668 AA456195 

412671 AW977734 

412673 AL042957 

412723 AA648459 

412739 AA116018 

412744 N31101 

412778 AA120882 

412811 H06382 

412838 D61870 

412854 BE004149 

413076 D69828 

413109 AW389845 

413117 BE066107 

413119 BE065941 

413141 BE166323 

413219 AA878200 

413228 AA127518 

413273 U75679 

413278 BE563085 

413294 BE144034 

413324 V00571 

413342 AAl 28536 

413430 R22479 

413707 BE158679 

413743 BE161004 

413753 U17760 

413786 AW613780 

413792 BE166924 

413804 T64682 

413833 Z15005 

413854 BE17430Q 

413918 AW015B98 

413968 AW500374 

414091 T83742 

414099 U11313 



gb:RC2-CTC163-2309gg-003-E01 CT0163 Homo 
gb:IL3-CT0213-21 0200-042-D02 CT0213 Homo 
gb:IL3-CT021 9-221 199-029-008 CT0219 Homo 
gb:IL3XT0220-16020C-071-H05 a0220 Homo 
gb:MR0-HT0167-141199-002-fD4HT0167Homo 
hypothetical protein FU20516 
ESTs 

Kl AAl 229 protein 

Putative prostate cancer tumor suppresso 
Homo sapiens oDNA; FLJ22028 Cs, clone H 
Homo sapiens clone 24674 mF!NA sequence 
gb:CM0-CT0341.26010O.160-f10 a0341 Homo 
gb:RC2-BN0033-180200-014-h09 BN0033 Homo 
gb:MR3-SN001G-240300-102-c10 SN0010 Homo 
gb;CM1-NN0063-280400-203-l07 NN0063 Homo 
gb:PM0^iJT0011-240300-00'1-a09 MTOOII Homo 



Hs.266273 
Hs.266273 
Hs.797 



HS.3184S 
Hs.179912 
Hs.271809 



Hs.118727 
Hs.1 95870 
Hs.75257 



Hs.75294 
Hs.24550 



gb:ydg81D4.s1 Soarestetal liver spleen 
ESTs 

Homo sapiens cDNA: FU22555 lis, clone H 
adaplor-relaled protein complex 2, alpha 
gb;RC3-PT0028-1SC100-012-h02PT0023Ho 
ESTs, Weakly similar to putative [Celeg 
v-rayc avian rayelocytomatosis viral oncog 



Hs,108932 ESTs 



lorY6 



ck90kDpr( 



1, alpha 



gb:QVO-DT0020-090200-107-a06 OTC020 Homo 
gb:QV3-DT0043-21 1299-044-306 DTCa43 Homo 
gb;QVl-DT0072-l 10200-066-105 OT0072 Homo 
gb;QV3-LT0048-140200-oa3*05 LT0048 Homo 
gb;QV0-ETQ001-0605(M-228-e09 ETOOOl Homo 
Homo sapiens cDNA FU13346 lis, clone OV 
Homo sapiens cDNA FU13346 frs, done OV 
nuclear transcription factor Y, alpha 
gb;23al 2 Human retina cONA randomly prim 
gb:EST33147 Human Embryo Hoirx) sapiens cO 
desmoplakin (DPI, DPIl) 
EST 



ESTs 10.75 0.07 

gb:EST389963 MAGE resequences, MAGO Homo 2.66 a3 



Hs.13500 ESTs 



gb:HUM218F1 1 B Oontech human aorta polyA 



ESTs, Weakly similar to ALU1_HUMAN ALU S 
gb;RC3-BT031 9-1001 00-012-d12 BT0319 Homo 
gb:QV4-HT0492-27010a-086-e12 HT0492 Homo 
Homo sapiens cDNA FU13e92 fis, clone PL 
ESTs 

Hairpin binding protein, histone 
inlerferon-stimulated protein, 15 kDa 
gb:MR0-HT01 65-191 199-004-302 HT0165 Homo 
corticotropin releasing homnone 
gb:zl24e04.r1Soares_pregnanLuterus.NbH 
Homo sapiens cONA FU13047 lis, ckxie NT 
gb:avlO-HT0395-280100-169-c04 HT0395 Homo 
gb:PIVI0-HT0425-1701OO-002-h03 HT0425 Homo 
Human DNA sequence from clone 272L1 6 on 



gb:CM4-HT0501-240300-519-f01 HT0501 Homo 
gb:yc48b02.r1 Stratagene liver (937224) 
centromere protein E j312kD) 
heat shock protein hsp70-rElated protein 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
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upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulstestsge 



upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
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414116 AA687370 

414127 AI431863 

414169 AA136169 

414275 AW970254 

414304 AI621276 

414338 N80751 

414447 AA147549 

414494 AA768491 

414520 AA148806 

414569 AF1 09298 

414575 H11257 

414597 H67472 

414643 H46177 

414658 X58528 

414661 T97401 

414683 S76296 

414735 BE468016 

414737 A1160386 

414747 U30672 

414774 X02419 

414783 AW069569 

414799 A1752416 

414833 T07114 

414883 AA926960 

414885 AA157531 

414918 AI219207 

414985 C17372 

415025 AW207091 

415033 D31476 

415060 AJ223810 

415058 Z19448 

415095 059592 

415099 AI492170 

415104 D60076 

415114 D60468 

415138 C18356 

415139 AW975942 
415143 236953 
415153 C0350a 
415178 D80503 
415217 H23983 
415227 AW821113 
415238 R37780 
415241 F02208 
415295 R414S0 



415330 244693 

415336 T77664 

415337 Z44881 
415352 F06565 
415364 FC6771 
415371 R15239 
415412 F08049 
415451 H19415 
415462 R52692 
415496 R37637 
415509 R4CC00 
415511 AI732617 
415542 R13474 
415569 Z43930 
415600 F12664 
415616 F12945 
415626 243847 
415635 F13168 
415750 AA167712 
415786 AW419196 
415788 AW628686 
415790 R23574 
415799 AA653718 
415837 H05279 
415857 AA866115 
415905 AI751357 

415947 U04046 

415948 AA262226 
415979 H16427 
415989 A1267700 



Hs.301471 ESTs 



ls.204046 ESTs 



Hs.295233 ESTs 



Homo sapiens cDNA: FU22724 lis, do 



Hs,21929 
Hs.76888 
Hs.281904 
H6,125087 
Hs.77204 
Hs.77274 
Hs,75839 
Hs,77326 



Hs.72402 
Hs.21422 
Hs.27214 
Hs,6546 



Hs.267924 
Hs.78851 
Hs.23545 
Hs.225841 



ATP-blnding cassette, sub-family D (AID) 



centromere protein F (350/400kD, mitosln 



zinc finger protein 6 (CMPX1) 
Insulln-Iike growlii factor binding prote 
gb£ST05003 Fetal brain, SIratagene (cat 
Hs,7756C COC28 protein kinase 1 



Hs.301 448 Homo sapiens cDNA FUI 21 52 lis, clone MA 
Hs,43213 ESTs, Weakly simllarto lEFS.HUMAN TRANS 
Hs,131887 ESTs, Weakly simllarto ORF YNL227C |S.c 
Hs,34746 ESTs 

Hs,77917 ubiquitin car1»xyl-termlnal esterase L3 

gb.'HUM084E10A Cbniech human fetal brain 
gb:HUM1 1 1 A06B Clontech human fetal brain 

Hs.78045 tissue factor pathway Inhibitor 2 

Hs.48524 ESTs 

Hs.48527 ESTs 



gb;HUM0B0A02B Human fetal brain (TFu]lwa 



gb:HSC01A011 noimalized infant brain cDI4 



Hs,52132 
Hs.268720 
Hs.12698 
Hs. 12286 
Hs.91968 
Hs,1 82362 
Hs,290263 



gb:HSC1CG051 normalized Infant brain cDN 
gb:HSC1 K[X)31 nomialized Infant brain cDN 
gb:yf89b02,r1 Scares Infant brain 1 NIB H 



Hs.12294 ESTs 



gb:HSC1 MC051 normalized infant brain cDN 
gb:HSC3JF101 nomialized Infant brain cDN 
gb:zq39g08.s1 Stratagene hNT neuron (937 
Homo sapiens cDIMA FU13782 Us, clone PL 

KiAA0217 protein 
ESTs 

DKF2P434D193 protein 
ESTs 

Home sapiens cDNA FLJ11381 ns, clone HE 
Homo sapiens cMh FLJ22256 lis, ctone H 
mutS (E. coK} homolog 2 (colon cancer, 
gb:zs24h06,r1 NCLCGAP.GCBI Homo sapiens 
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upregulate stage 
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upregulate slage 
upregulate slage 
upregulate stage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulate slage 
upregulate stage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulate stage 



109 



wo U3/003906 



PCT/US02/21338 





AW1 38239 












416052 


R12816 


Hs,'21164 


ESTs 


1.45 


0.24 


upregulate stage 














































BE387371 




Homo sapiens cDNA: FLJ21017 fis, clone C 
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ESTs 
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v-yes-1 Yamaguchi sarcoma viral related 
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L24203 


Hs!82237 


ataxia-telangiectasia group D-associaled 
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upregulate stage 
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ESTs 






upregulate stage 
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AA339449 


Hs.82285 


pbosphoribosylgiycinainide'ormylt'ansfer 
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upregulate stage 
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R26968 




ESTs, Weakly similar lo ALU7.HUMAN ALU S 


9.15 
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upregulate stage 
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Hs'.226823 


ESTs 


4.35 


0.14 


upregulate stage 
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AA204688 




ESTs, Wealily similar to ALU7_HUMAN ALU S 


0.94 
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upregulate stage 
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R02530 


Hs.191198 


ESTs 
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upregulate stage 
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R12490 


Hs.189779 


ESTs 
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upregulate stage 
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T05870 


Hs.100540 


ESTs 
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upregulate stage 
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AW969587 


Hs.86366 


ESTs 
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upregulate stage 
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AA205525 


Hs.208057 


ESTs 


4.65 




upregulate stage 


417742 


R64719 




gb:EST22d1 1 WATM1 Homo sapiens cDNA clon 


4.15 
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upregulate stage 
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A1267720 


Hs^23 


neuroblastoma RAS viral (v-ras) oncogene 


9.98 
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upregulate stage 
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Z43482 


Hs.82772 


collagen, type XI, alpha 1 
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upregulate stage 
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Hs^67054 


ESTs 
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ESTs 
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Homo sapiens cDNA: FLJ23186 fis, clone L 
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reuromedin B 






upregulale stage 
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ESTs, Weakly similar to prolyl 4-hydroxy 
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Hs,83724 


Human clone 23773 mRNA sequence 
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Hs.98445 


Homo sapiens cDNA: FU21652 fis, clone C 
CDC28 protein kinase 2 
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upregulale stage 
upregulale stage 
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Hs.283099 


AF15q14 protein 
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hypothetical protein FIJ20093 
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gb£ST1870g5Cobn carcinoma (HCq cell 
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gb:nc32g04.s1 NCLCGAP Pr2 Homo sapiens 
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gt);no21ti04.r1 NCt_CGAPrPr1 Homo sapiens 
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Hs.87013 


Homo sapiens cDNA FU10238 lis, done HE 
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upregulate stage 
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bone morphogenetic protein receplort typ 
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Hs.87409 


Ihrornbospondini 
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insulinoma-associated 1 
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Hs.262346 


ESTs, Weakly similar to 0RF2: liiiiction u 
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upregulale stage 
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cytochrome P450, subfamily XXIV (uitanin 


3.55 
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upregulalestage 
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AI248073 


Hs!l88723 


ESTs, Weakly similar to ALU1_HUIV1AN ALU S 
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AI342491 


Hs.87413 


ESTs 
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upregulalestage 
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Hs,288433 
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up.'egulate stage 
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Hs.221303 


ESTs 
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upregulalestage 
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AA521504 


Hs.190179 


ESTs 






upregulalestage 


419355 


AA428520 




progesterone binding protein 


10.6 


0.06 


up.'egulate stage 
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AL043202 


Hs!90073 


chromosome segregation 1 (yeast homolog) 




0.47 


upregulalestage 
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AA237040 


HS.875B9 


ESTs 






upregulate stage 
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AA991639 


HS.1S036 


ESTs, Highly shiilar to AF161358 1 HSPCO 
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upregulate stege 
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Hs.90572 


PTK7 protein tyrosine kinase 7 
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upregulate stage 
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AW978036 




gb:EST390147 MAGE resequences, MAGO Homo 
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upregulate stage 
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AA243420 


Hs.87648 


ESTs 
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nb-od03g07.s1 NCLCGAP_GCB1 Homo sapiens 
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Homo sapiens cDNA FLJ13901 lis, clone TH 




















upregulale stage 
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Hs,104318 
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KIAA0480 gene product 
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upregulale stage 
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gb:EST379019 MAGE resequences, MAGJ Homo 
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Hs.220697 
















gb:zsC3g12.s1 NCLCGAP.GCBI Homo sapiens 






upregulate stage 
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Hs.120219 








upregulale stage 
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Hs.118923 
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upregulate stage 
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gb:zs26d09.r1 NCLCGAP_GCB1 Homo sapiens 






upregulate stage 
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upregulate stage 
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gb:2s21h11.r1 NCLCGAP_GCB1 Homo sapiens 






upregulate stage 




AA809860 
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Hs.99785 


Homo sapiens cDNA: FLJ21 245 lis, done C 






upregulate stage 
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Hs.175319 
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hypothetical protein 
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AL034344 


Hs.298020 


Homosapiens cDNA FLJ11796 fis, clone HE 


10.35 


0.06 


upregulate stage 
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Hs.291111 


ESTs 
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upregulale stage 
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gb:zs35h07.r1 NCLCGAP GOBI Homo sapiens 
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Hs.88323 
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KIAA0328 protein 
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Hs.193514 
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Homo sapiens mFM; cDflA DKFZp564H0616 (f 
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liypothetical protein 
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ESTs 
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Hs.293736 
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upregulale stage 
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U22376 
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AA281062 
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AI809621 


HS.105S20 








upregulate stage 
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gb:zt10e01.rl l<ICI_CX3AP_GCB1 Homosapiens 






upregulate stage 
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tmo- necrosis ao r.ap a- uoe pro 
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AW351839 


Hs.124660 


Homo sapiens cOHH. FJ21763 lis, clone C 
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upregulate stage 
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Hs,89217 


ESTs 
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ESTs 




0,1 


upregulale stage 
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ESTs 
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upregulate stage 
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AI471925 


Hs!89257 


ESTs 
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H87879 


Hs.102267 


lysyl oxidase 
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upregulate stage 
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AW978316 


Hs.1 36649 


ESTs 
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upregulate stage 
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NM_014721 


Hs.)02471 


KIAA0680 gene product 


5.7 
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upregulate stage 
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NM 000499 


Hs.72912 


cytochrome P450, subfamily 1 (aromatic c 


0.07 
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upregulale stage 
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AW276914 


Hs.300877 


ESTs 
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upregulate stage 
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AI0S6S90 


Hs.7086 


Homo sapiens cDNA: FLJ23000 fis, clone L 
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upregulate stage 
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AA60a853 
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ESTs 
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upregulale slage 
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gb;ob39a05.s1 NCI_CGAP_GCB1 Homo sapiens 
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small inducible cytokine subfamily B (Cy 
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gb;EST71314 T-oell lymphoma Homo sapiens 
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Ste2a.related serinefthieonine kinase 










BE465451 
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single-minded (DrDsopiiila) homobg 1 
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upregulale stage 
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gb;CM0-CT0341-181299-130-ii12 CT0341 Homo 
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upregulale slage 










197 




upregulate slage 




AW381089 


Hs.1 08806 
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upregulale stage 
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Hs. 109050 
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acldc epIdUymal glycoproteln-like 1 
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51.9 
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Hs.110488 


KIAA0990 protein 
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■<IAA0712 gene product * 
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Hs.272027 
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AL043892 


Hs.180582 


Homo sapiens cDNA: FLJ21 335 fis, clone H 
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AA339015 
AA307595 










upregulate slage 




AB038995 




RAB-8b protein 
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gb:EST182167 JurkatT-cells V Homosaple 
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ATP-blndIng cassette, sub-family D (ALD) 






upregulde stage 
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Hs.118110 
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Home sapiens MK FLJ23548 fls, clone L 






upregulale stage 
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upregulale stage 
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Hs.1 19155 


Homo sapiens mRI^ cDNA DKFZp434624g (fr 
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hypotbettcal protein FLJ10549 
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Hs.121553 


hypothetical prolan FLJ20641 
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gb:EST25a73 Cerebdium 11 Homo sapiens c 
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AW864848 




gb:PM2-SN0018-290300-003-c09 SN0018 Homo 
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upregulate stage 
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NMJ06479 


Hs.126067 


frequently rearranged In advanced T-cell 


9.75 
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upregulale slage 
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Hs.3327 


Homo sapiens cDNA; FLJ2221 9 fis. clone H 
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upregulate stage 
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BE006775 


Hs.1 26782 


sushi-repeat protein 


1.58 
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upregulate slage 
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AI660412 




ESTs 






upregulate slage 
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NM 014170 


Hs!l274g6 


HSPC135 protein 




o!69 


upregulale slage 
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AA364195 




gb:EST75015 Pineal gland II Homo sapiens 




0,95 


upregulale slage 
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A1471609 


Hs.64347 


ESTs 


3.95 


0.14 


upregulate stage 
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AF112481 


Hs.1 28501 


RAD54. S. cerevisiae, homolog of, B 


1.62 


0.43 


upregulale stage 
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R6864g 


Hs.278359 


absent in melanoma 1 like 


6.25 
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upregulate stage 
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Hs.128791 


CGI-OS protein 


8.46 


0.09 


upregulate slage 
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Hs.129705 
Hs.222830 


ckine 1900 unknown protein 
ESTs 
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Hs,18612 


Homo sapiens ct^NA: FU21909 Is, clone H 


3.4 
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upregulale slage 
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gb;EST32368 Embryo, 12 week 1 Homo sapie 
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upregulale slage 
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AW452650 


Hs.157148 


Homo sapiens cDNA FLJ11883 lis, clone HE 
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423748 AI149048 

423753 Y11312 

423758 AA338153 

423774 L39064 

423818 AA332439 



423827 

423837 AW937063 

423912 BE081233 

423938 AL049328 



Hs.172625 ESTs 



423946 AL137344 

423956 W28203 

424006 AF054815 

424008 R02740 

424012 AW368377 



424353 AA339645 

424364 AW383226 

424406 D54120 

424420 BE614743 

424426 AB0314B0 

424486 BE0O2477 

424490 AJ278016 

424492 AI133482 

424505 AA446131 

424513 BE385864 

424575 AL110217 

424583 AFQ17445 

424589 AW854298 

424602 AK002055 

424625 AW904466 

424629 M9C656 

424643 AF241B50 

424649 BE242035 

424653 AW977534 

424670 W61216 

424690 BE538356 

424701 NM_005923 

424702 AF260237 
424717 H03754 

424735 U31875 

424736 AF230877 
424749 NM_002451 
424834 AK001432 
424841 AI280215 
424860 W60828 

424878 H57111 

424879 AA348013 
424888 AA348123 
424906 NM_0024S7 
424930 AAB85344 
424948 AA348810 
424951 AW964082 
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425020 U09368 
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425057 AA826434 
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426216 U81504 



Hs,135642 
Hs.1 35723 
Hs,1282S1 
Hs.1 35692 
Hs.136169 
Hs.137648 
Hs.137555 
Hs.1 37569 
Hs.138959 
HS.22763Q 
Hs.21638 
Hs.142868 



Hs.149894 
Hs.150761 
Hs.150926 

Hs.301129 
Hs.151310 
Hs.l 51393 
Hs.151428 
Hs.151461 
Hs.1 51469 
Hs.116861 
Hs.151777 
Hs.151988 
HS.152C09 
Hs.152213 
Hs,1 52677 
Hs.1 62701 
Hs.1 52817 
Hs.1 53408 
Hs.96885 
Hs.1 53529 
Hs.221132 
Hs15g354 
Hs.24882 
Hs.1 53704 
Hs.96gi0 
Hs.1 90503 



PL 



gb:PM3-DT0037-231299-001-g11 DT0037 Homo 
gb:PMO-BT0725-300300-001-H07 BT0726 Homo 
Homo sapiens mRNA; cDNA DKFZp564E02S (fr 
glycollpid transfer protein 
phosplxxiiesterase 10A 

Homo sapiens mRNA; cDNA DKFZp761l1311 (f 
Homo sapiens clone 25215 mRNA sequence, 
CD84 antigen (leukocyte antigen) 
putative cliemoklne receptor; GTP-blndIng 
tumor protein 63 kDa with strong homokig 
gap junction protein, alpha 7, 45kD (con 
RE1-sllenclng transcription factor 
ESTs 

Homo sapiens CDNAFU11143 Us, 
gb:EST44755 Fetal brain I Homo sapiens 
ESTs, Weakly similarto DRPIA lH.sapiens 
wingless-type MMIV integration site fami 
prostaglandin E synthase 
SPR1 pnjlein 

chloride intracellular channel 6 
ankyrin repeal domain 3 
ESTs 

Homo sapiens cDNA FLJ13186 Us, clone NT 
mitochondrial transiational initiation f 
DKFZP572C1 63 protein 
fucose-1 -phosphate gtianyiyitransferase 
glj:RC3-CT0254-100500-21 1-c03 CT0254 Homo 
Homo sapiens clone 23869 mRNA sequence 
PDZ domain protein (Diosophiia inaO-IIke 
glutamate-cystelne llgase, catalytic sub 
ret finger protein 2 



caicium/caimodulin-dependent serine piot 



Humai transialion initiation fector elF< 
mitogen-activated protein kinase kinase 
6 protein-coupled receptor 85 
wingless-type (iJMTV integration site fami 
Homo sapiens cDNA FU20338 fis, done HE 
microtubule-interacting protein that ass 
methylthloadenosine phosphoiyiase 
Homo sapiens cDNA FU10570 Us, done NT 
ESTs 

Homo sapiens done 24540 mRNA sequence 



N IMA (never in mitosis gene a|-reiated k 



gb;EST378155 lulAGE lesequences, MAGH Homo 
gb:HSC2CF021 nonmaized Infant brain cDN 
zinc nnger protein 140 (done pHZ-3g) 



ls.154443 

ls.155065 
HS155172 
Hs.165909 ESTs 



KIAA0103genepiadud 



425289 AW139342 
425304 AA463844 
425316 AA354977 



ESTs, tftiderateiy simitar to NSD1 protein 
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upregulate stage 
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3,08 


0.15 


upregulale stage 






upreguiate stage 






upreguiate stage 


|.44 


0.55 


upreguiate stage 




































































1.19 


0.67 


upregulale stage 


1.42 




upregulale stage 


















































2!85 


0.2 


upreguiate stage 
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upregulale stage 


141 












































































































3.2 


0^12 


upregulale stage 


8.75 


0.09 


upregulale stage 






upregulale stage 






upregulale stage 


2.65 


0.13 


upreguiate stage 




0.22 


upregulale stage 


9.5 


0.07 


upregulale stage 




0.52 


upreguiate stage 




0.32 


upreguiate stage 


7.05 


0.1 


upreguiate stage 


19.7 


0.04 


upreguiate stage 




0.69 


upreguiate stage 


10>15 


0.05 


ifpregulafe stage 


1.57 


0.51 


upreguiate stage 


5.05 


0.12 


upreguiate stage 
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425403 AL023753 

425415 M13903 

425420 BE536911 

425463 AK000740 

425465 L18964 

425467 R16484 

425492 AL021918 

425607 U09860 

425608 AA360486 
425614 AI334953 
425641 D79758 
425660 AA521184 
425665 AK001050 
425672 AA361483 
425707 AF1 15402 
425726 AF085808 
425742 AJ001454 
425785 T27017 
425811 AL039104 
425843 BE313280 
425852 AK001504 
425883 AL137708 
426010 AA1 36563 
426028 NM.OOmO 
426101 AL049987 
426108 AA622037 
426115 H08895 
426168 NM_003152 
426257 AL137201 
426261 AW242243 
426283 NM_003937 
426451 AI908165 
426462 U59111 
426490 NM_001621 
426514 BE516633 
426561 AA381437 
426711 AA383471 
426731 AW303411 
426759 AI590401 
426786 AA319798 
426788 U66615 
426818 AA554827 
426824 D87717 
426827 AW067805 
426921 AA037145 
426936 NM_000088 
426997 BE620738 
427071 AA397958 
427126 AA620613 
427134 AA398409 
427142 AA398510 
427259 AA400096 
427308 D26067 
427315 AA1 79949 
427356 AW023482 
427370 AI243515 
427376 AA401533 
427387 BE244966 
427470 AWg99924 
427519 AW085233 
427521 AW973352 
427528 AU077143 
427566 AI743515 
427581 NM_014788 
427585 D31152 
427603 AI090838 
427646 AI678042 
427652 AI673025 
427742 AA411880 
427814 W28383 
427839 AA608823 
427878 C05766 
427922 AK001934 
427933 AW974643 



Hs.155537 

Hs.156346 
Hs.1 56406 
Hs.1 57091 
Hs.234545 
Hs.157986 

Hs.'l90075 
Hs.1 581 74 
Hs.158333 
Hs.92443 
Hs.156256 
Hs.14355 
HS.1D5504 
Hs.159066 



Hs.159627 

Hs.159651 

Hs.161031 

Hs.1975 

Hs.1 72028 

Hs.155361 

Hs.1 66468 

Hs.165733 

Hs.157503 



protein kinase, DNA-adivated, catalytic 
gb:EST64410 JurkalT^ds VI Honnosapla 
lopoisomerase (DIvIA) II alplia(170kD) 
Human DMA sequenoalinra clone 1198H6 on 

ESTs, Weakly similar lo API 55135 1 novel 
hypcthet'cal protein FLJ20733 



Hs.1 8 
Hs.1 30332 
Hs.21213 
Hs.1 72247 
Hs.172280 
Hs.124841 
Hs.172652 
Hs.172665 
Hs.172865 
Hs,172928 
Hs.1 731 25 
Hs.192719 
Hs.1 91 327 
Hs.173561 



Hs.174905 
Hs.l 75563 
Hs.97849 
Hs.97740 
Hs.1 9440 
Hs.1 77584 
Hs.1 78357 



Hs.1 79703 
Hs.1 79729 
Hs.98006 
Hs.271953 
Hs.43874 
Hs.1 90888 
HS.1B0900 
HS.9B244 
HS.1B1022 
Hs.181112 
Hs.190571 



Homo sapiens cDNA FU13207 lis, clone NT 
ESTs 

liypothetlcal protein FU1C188 
gb:EST70790 T-cell lymphoma Homo sapiens 
E744lke factor 5 (ets domain transcript 



Hs.159425 testican 3 



Homo sapiens ck>ne 24400 mRIM sequence 
karyopiierin alpiia 2 [RNS colnrt 1, impor 
deat)) associated piotein 3 
deati) receptor 6 

Homo sapiens mRNA; cDNA DKFZp434K0322 (f 
Homo sapiens cDW: FU21007 lis, clone C 
a disintegrin and melslloproleinase doma 
Homo sapiens mRNA; cDNA DKFZp564F112 (fr 
programmed cell death 5 
ieuoyi/cystinyi aminopeptidase 
signal transducer and activator of trans 
KIAA0979 protein 
peroxisomal farnesylated protein 
kynureninase (L-kynurenine hydrolase) 
GATA-binding protein 3 
dermatan sulphate proteoglycan 3 
aiy! hydrocarbon receptor 
bono morphogonctic protein 7 (osteogenic 
gb;ESTg4514 Activated T-cells I Homo sap 
conserved gerie amplified in osteosarcoma 



ESTs 
ESTs 

Iclransli 

, matrix associated, acli 
ESTs, Weakly similar to ALU5.HUMAN ALU S 
KiAA0013 gene product 
melhylenetelraliydrDfdIate dehydrogenase 
cleavage sUmulatbn facbr, 3' pre-i^A, 
collagen, type i, alpha 1 
peptidylprolyl isomerase F (cydophilin 



gb'.zu69f07.s1 Soares.lesti5_NHT Homo sap 
KIAA0033 protein 

Homo sapiens mRNA; cDNA DKFZp564N0763 (f 



minichnomosome maintenance deficient (S. 
gb:wf72b08.x2 Soares_NFL_T_GBC_S1 Homo s 
KIAA0129 gene product 
collagen, type X, alpha 1 (Schmid metaph 



upregulale stage 
upregulale stage 
upregulale stage 



C.39 upregulale stage 

a44 upregulale stage 

0.15 upregulale stage 

0.47 upregulale stage 

0.68 upregulale stage 

0.34 upregulale stage 

0.04 upregulale stage 

0.05 upregulale stage 

0.18 upregulale stage 

0.32 upregulale stage 

.0.4 upregulale stage 

0.29 upregulale stage 

0.16 upregulale stage 

0.04 upregulale stage 

0.28 upregulale stage 

0.36 upregulale stage 

0.05 upregulale stage 

0.47 upregulale stage 

0.11 upregulale stage 

0.05 upregulale stage 

0.21 upregulale stage 

0.06 upregulale stage 

0.09 upregulale stage 

0.14 upregulale stage 

0.08 upregulate stage 

a87 upregulate stage 

a09 upregulale stage 

ai9 upregulate stage 

0.72 upregulale Stage 

0.06 upregulate stage 

0.08 upregulate stage 

0.18 upregulate stage 

0.18 upregulate stage 

0.25 upregulate stage 

0.22 upregulate stage 

0.1 upregulate stage 

0.1 1 upregulate stage 

0.13 upregulate stage 

0.14 upregulate stage 

0.16 upregulate stage 

0.39 upregulate stage 

0.16 upregulate stage 

0.1 upregulate stage 

0.1 upregulale stage 

0.15 upregulale stage 

1 upregulale slage 

0.06 upregulale stage 

0.16 upregulale stage 

1 upregulate stage 

0.11 upregulate stage 

0.34 upregulate stage 

0.16 upregulate stage 
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427961 AW293155 

427986 N45214 

428003 AL11C200 

428004 M449563 



Hs.291866 
Hs.48401 
Hs.143134 
Hs.282387 
Hs.1 81384 
Hs.300270 
Hs.186375 
Hs,185798 
Hs.1 91602 
Hs.182339 
Hs.293317 
428192 AA424051 
428227 AA321649 Hs.2248 
428403 AI393048 Hs.239894 
428436 BE080180 
428450 NM-014791 Hs.18«39 
428479 Y00272 Hs.184572 
AW262022 



428576 



428716 AL122118 

428783 AW070204 

428788 AF08228a 

428829 R14050 

428839 AI767756 

428881 AI298368 

428954 AF100781 

428988 AA442900 

429042 AW015489 

429057 AF1 56557 

429066 AA868556 

429072 AI376228 

429083 Y09397 

429091 AA935658 

429115 AA446728 

429127 AA749382 

429136 AA446966 

429170 NM_001394 



H3.167621 
Hs.186758 
Hs.189095 
H3.189S27 
Hs.190614 
Hs.178176 
Hs.193516 
Hs.194051 
Hs.82302 



429174 



AA448407 



Hs.198481 
Hs.198891 
Hs.186180 



429359 W00482 

' 429376 AIB67889 

429412 NM_C06235 

429450 AA8244S1 

429472 AW452421 

429482 AF076974 

429486 AF155827 

429672 AW296375 

429584 AIB17785 

429590 AI219490 

429597 NM_003816 

429601 AIB04293 

429602 AA521463 



429631 AA455612 



429673 AA884407 



Hs.2407 

Hs.94292 

Hs.1 6662 

Hs.203952 

Hs.203g63 

Hs.39474 

Hs.1 83037 

Hs.44445 

Hs.2442 

Hs.119406 

Hs.1 83424 

Hs.211563 

Hs.30622 

Hs.136710 

Hs.166379 

Hs.211581 

Hs.211588 

Hs.211696 

Hs.15g300 

Hs.220689 

Hs.1 80791 

Hs.225767 

Hs.1 08241 



AA460608 

H80572 

AAS26911 

AW061608 

AF079550 



ESTs, Weakly similar to ALU8_HUMAN ALU S 



ESTs 
ESTs 

els homologous factor 

ESTs, Weakly similarto JM27 |H.sap!ensl 

gb.zi/80d03.s1Soares_total_fetUs.Nb2HF8_ 

small inducible cytokine sublamily B (Cy 

leucine rich repeal (In FUl) interactin 

gb;RC4-BT0629-1 20200-01 1-b10 BT0629 Homo 

K)AA0175gene product 

cell division cyde 2, G1 to S and G2 to 

Homo sapiens cDNA FU12839 lis, clone NT 

ESTs 

KIAA1441 protein 

similar to SALLI (sal (Dro5ophlla)-llke 
Homo sapiens clone 25016 mRI^ sequence 
Homo sapiens mRNA; cDNA [lKFZp43401221 (T 



Homo sapiens mRNA; cDNA DKFZp565B213 (fr 



Hs.98i 

Hs.27947 
Hs.235920 
Hs.194816 



Hs.107233 ESTs 



WNT1 inducible Signaling patliway protein 



ns cDNAFLJ14098ns,cl( 



gb:zw68d11.s1 Soares_test(s_NHT Homo sap 
RAR-relaled orphan receptor B 
serlne/threonine-proteln Idnase PRP4 horn 
Homo sapiens cOiA: FLJ23038 fis, cbns L 
inliibln,betaC 
matrix metalioprob 
ESTs 



protein kinase, cAMP-dependent, regulate 
ESTs, Weakly similar to Kelch motif coni 
a disintegrin and melaltoproleinase doma 
ESTs, Weakly similarto AF143946 1 trans 



melai-regulatory transcription factor 1 

POU domain, class 4, transcription facto 
protein tyrosine phosphatase, non-recept 
ESTs 

Ras-GTPase-aclivating protein SH3-domain 
ESTs 

IDN3 protein 

ESTs, Weakly similar to Tlie K1AA0191 gen 
protein tyrosine phf-" ' 



gb:yu76c02.r1 Soares letal liver spleen 



glial cells missing (Drosophila) homolog 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
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15 
20 
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430020 A1539C29 

AA463913 
430049 AW277086 
430060 NM_002941 
430076 AA465115 
430134 BE380149 
AB013802 
430195 AW969308 



Hs.99607 
Hs.227277 
Hs.99607 



430387 AW372884 

430486 BE062109 

430488 019589 

430512 AF1B2294 
AF1 29534 
AK00Q062 

430561 BE065227 

430563 AA481269 

430598 AK001754 

430630 AW269920 

430634 AI8606S1 

430637 BE180081 

430640 AA482636 

430665 BE350122 

430726 AL03)224 

430733 AW975920 

430781 AW088127 

430791 AA485293 

430817 AA487242 

430888 BE155293 

430918 NM.00Q843 

430926 L05597 

430994 AA490346 

431009 BE149762 

431023 AI283133 

431030 AA830525 

431041 AA490967 

431070 AW408164 

431082 AA491600 

431089 BE041395 

431146 Z83850 

431173 AW9711g8 



431322 AW970622 

431332 AA503297 

431343 AW970603 

431346 AA371059 

431347 AI133461 
431381 AA577114 
431448 AL137517 
431494 AA991355 
431510 AA5B0082 
431660 BE244136 
431571 AW5CC486 
431596 T34708 
431610 AK000972 
431613 AA018515 
43163C NM_002204 
431663 NM_016569 
431670 AW971287 
431689 AA305688 

431691 A1208511 

431692 AL021331 
431694 AW970112 
431726 NM.015361 
431736 AI912234 
431753 X76029 
431781 AA515474 



H5.11710B 
H5.300941 
Hg,261636 
H3.2S1664 
Hs.105727 
Hs,2883B1 
Hs,12980B 



gb2x97a10.r1 SoaresJJhHMPu_S1 Homosapi 
Honw sapiens cDNA FU13841 fis, done TH 
sine oculls homeobox (Drosophlla) homolo 
Homo sapiens cDNA FU13841 Js, clone TH 



roundabout (axon guidance receptor, Dros 
gb:aa32c11.r1 NCI_CGAP_GCB1 Homo sapiens 
ESTs, Weakly similar to contains similar 



ls.188594 ESTs 



Hs.1 25759 
Hs.238126 
HS.1C6747 



Hs.241578 
Hs.49210 
Hs.243756 

Hs.178381 

Hs.247112 

Hs.2621 

Hs.26685 

HS.2S6290 

Hs,157367 



ESTs, Weal<ly simiiar to tumor suppressor 
CGI-49 protein 

serine carboxypeplidase 1 precursor prol 
gb:PI»11-HTG527-28020O-0O5-a05 HT0527 Horn 
nuclear cap binding protein subunit 2, 2 
cliioride channel, calcium activated, fam 
ESTs, Moderately similar to tetracycline 
U6 snRNA-associaled Sm-like protein LSm8 
F-boxonly protein 4 
liypollieli'cal protein FLJ20055 
gb;RC1-BT0314-310300-015-b06 BT0314 Honn 



Hs.283361 ESTs 
Hs.278636 ESTs 



Human DI\1A sequence from clone 336H9 on c 



Hs.272068 ESTs, Moderately similar to altornab'vel 



ribosomal protein I27a 
gfutamale receptor, metabotropic 6 
5-hydroxytryplamlne (serotonin) receptor 



Hs.76064 
Hs,248131 
Hs.24ai36 
lls.40530 
HS.24S213 



gap junction protein, beta 6 (connexin 3 



Hs.29ig88 ESTs 



transcription factor 19 (SCI) 

ESTs 

ESTs, Wealdy similar to unl<nown protein 
Human DNA sequence from PAC 82J11 and co 



gb£ST382704 MAGE resequences, MAGK Homo 
ESTs 

Homo sapiens cDNA FU11661 fls, ctone HE 
ubiqultin specific protoase 3 
InsuHn-IIke growBi factor 2 (somatomedl 



hypotlietical protein FUIO1 10 
Apg12(aulophagy12,S.cerevlsli . 
integrin, alpha 3 (antigen CD49C, alpha 
TBX3-iso protein 
gb:EST383376 ^MGE resequences, MAGL Homo 
UDP-Gal:betaGlcNAc beta 1,3-galactosyltr 
ESTs 



H5,292697 ESTs 

Hs.268053 KIAA002 

Hs.151245 ESTs 

Hs,2841 neuramei 
H5.99908 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 



1.61 0.42 upregi 



late stage 
late stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
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Hs.270845 


kinesin-like 5 (mitotic tdnesin-lil^e pro 














delta-tubulin 
























AA572994 




gb:nm3^1Zs1 I^CI CGAI^Up2 Homo sapiens 




























integrin, alptia 2 {CD49B, alplia 2 sutiuni 














Homo sapiens cDI^ FU20099 fis, done CO 
























X63629 
















Hs.291069 






































AW2731 28 




























AF220217 


Hs.272374 


Homo sapiens rsec15-lit(e protein mRt^ p 










AA401039 














AW975868 


Hs]29410C 












Hs. 2691 09 








upregulate stage 








gb:yl52c03.r1 Soares breast SNbHBst Homo 


























Hs.27 2798 


tiypotiietical protein F1J20413 
























AU076609 














A(204995 




gb:an03c03.x1 Slrat£gene sciifzo brain SI 












Hs.1 90297 












AK001926 




hypiMcal protein FU11064 
























AK001239 




hypotlietlcal protein FUll 0377 ^ 






upregulate stage 




AA534197 


Hs.272693 














Hs.301886 


Homo sapiens cDNA FU1 1 346 fis, done PL 






upregulate stage 








SI 00 calcium-binding proton P 










AA221036 


Hs.285026 


HERV-H LTI^assodating 1 


























Hs.289014 


























BE218886 


Hs. 282070 
















































































































ESTs, Wealdy similar to unnanied protein 










AL043482 


Hs. 2671 16 








upregulate stage 




















Hs.135567 














Hs.278627 
















gb;H.5apiens mRNA for ligase like pratei 














PRO1510 protein 










NM_014124 




PRO0255 protein 






































AK001403 


Hs.279521 


hypotfreticai protein FU20530 














hypotiietical protein F1J20080 










NI\(L014125 


Hs.279812 


PRO0327 protein 










AW270239 






























Hs.279905 


clone HQ0310 PRO0310p1 








































Hs.1 50587 










433201 


AB040896 


Hs.21104 


KIAA1463 protein 


9.2 


o!o9 


upregulate stage 


433211 


H11850 


Hs.1 2808 




1.6 • 




upregulate stage 


433218 


AI040372 


Hs.278894 


KIAA1482 protein 




0^44 


upregulate stage 


433222 


AW514472 


Hs.238415 


ESTs, Moderately similar to ALUS HUMANA 


6.45 


0.12 


upregulate stage 




AW136134 


Hs.220277 


ESTs 


7.3 


0.09 


upregulate stage 




AB040930 


Hs.297021 


Homo sapiens cDNA FU1321 1 fis, done NT 


1 




upregulate stage 




AF026944 


Hs.293797 


ESTs 


4.95 


0.08 


upregulate stage 


433371 


T2S451 




3b:PTHI188HTCDL1 Homo sapiens cONAfffl 


4.75 


0.12 


upregulate stage 


433394 


A1907753 


Hs.93810 


cerebral cavernous maifbmiatlons 1 






upregulate stage 


433424 


R682S2 


Hs.163566 


ESTs 






upregulate stage 


433440 


AF052127 




gb:Homo sapiens done 23850 mRNA sequenc 






upregulate stage 




AW296906 


Hs.142869 


ESTs 


9.82 


0.08 


upregulate stage 


433456 


AAS93447 


Hs.1 24296 


ESTs 


9.45 


0.08 


upregulate stage 


433467 


AI420457 


HS.5095S 


ESTs 


1.11 




upregulate stage 


433479 


AW511459 


Hs.249972 




3.35 


o!l3 


upregulate stage 
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4334B4 


BE264397 


Hs.1 48674 


ESTs 








433S15 
433602 


AA595800 
A1769948 


Hs'24906 


ESTs 
ESTs 


3.05 




upregulate stage 
upregulale stage 


433613 


M836126 


Hs.5669 






0.12 




433625 


AW955674 




ESTs 




0.53 


upregulate stage 


43365B 


L03678 


Hs!l56110 


immunoglobulin kappa constant 


9.65 




upregulale stage 


433672 


BE281165 




TLS-assodated serine-aginlne protein 1 




0.08 


upregulate stage 


433730 


AK002135 




hypothelical protein FU11273 
ESTs 


67 




upregulate stage 


433735 




Hs.109653 


8.95 




upregulate stage 




AI287912 




mifogen-activated protein Idnase kinase 






upregulate stage 




AI399956 


Hs.208956 




5.5 




upregulate stage 


433929 


AI375499 




ESTs 


7 


0.09 


upregulale stage 




AF113017 




PR01 268 protein 






upregulale stage 




API 1301 8 


Hs.284302 


PR01621 protein 






upregulale stage 




AF1 13688 




gb;Homo sapiens clone FLB4630 






upregulale stage 




AF1 16601 


Hs.283048 


tiypotheUcal protein PRO0128 




0.09 


upregulate stage 




AL04S04S 


Hs.180758 


tiypotheHcal protein PRO0082 


a35 


0.09 


upregulale stage 


434085 


AF116673 


Hs.250029 


hypothetical protein PR0192S 




1 


upregulate stage 




AA62515S 




gb:af70d06.r1 Soares_NhHMPu_S1 Homosapl 






upregulate stage 




AA30SS9S 


Hs.238205 


hypothetkial protein PRO2013 






upregulate stage 




AA625804 




gb2uB6h01.s1 SoaresJesUs_NHT Homo sap 


ass 


o]ii 


upregulate stage 




AW387314 






1.65 




upregulate stage 




AF1 19847 


Hs!283940 


Homo sapiens PRO15S0 mRNA, partial cds 


7.85 


o!o9 


upregulate stage 




AW014795 






3.8 


0.13 




434228 




Hs.283978 


Homo sapiens PR02751 mRNA. complete cds 






upregulate stage 




AA897778 
BE00539a 


Hs.201677 


gb:CM1-BN0116-150400-189-ti02 BN0116 Homo 


9.45 


o!38 
0.07 


upregulate stage 
upregulate stage 




All 25686 


Hs.152727 




Z66 




upregulate stage 




AW974991 


Hs.191852 


ESTs, Weakly similar to ALULHUMAN ALU S 




o!61 


upregulate stage 














upregulale stage 




AA12109a 


Hs.3838 




10.7 


0.08 






BE063921 
AB037829 




regulator of nonsense transcripts 2; DKF 


10.15 
7.6 


0.07 
0.09 


upregulale stage 


434434 


W79839 


Hs. 104336 








upregulale stage 


434513 


AF1 43888 


Hs.18213 


liimo salens clone lhlAGE:121736 mRNA seq 


]' 




upregulale stage 






Hs.41407 


ESTs 




0.18 


upregulale stage 


434640 


NM.016045 




TH1 drosophila homolog 


1Z4 


0.06 


upregulale stage 






Hs.58609 


ESTs 


1.75 


0.38 


upregulale stage 










9.7 


0.06 


upregulale stage 


434627 


A1221894 


Hs!39311 


ESTs 


1.65 


0.17 


upregulale stage 




AA789081 




glioma-amplified sequence'41 






upregulale stage 




AA641972 


Hs.130058 








upregulale stage 




AA648049 


Hs.121618 








upregulate stage 




AA831115 


Hs.190473 






o!71 


upregulate stage 




AA648962 






10.55 




upregulate stage 




AA649253 


Hs.1 32458 




5.45 


o]ii 


upregulate stage 








phoifaol-1 2-myristate^l 3-acetate-induced 




0.34 


upregulate stage 




AF1 60477 


Hs!245781 


Homo sapiens Ig superfamlly receptor Ui] 




0.57 


upregulate stage 


434909 


AI479212 
BE5432e9 


Hs]50252 


hypothetical protein FtJ10890 

Homo sapiens HSPC283 mRNA, partial cds 


4.9 


o!i3 


upregulate stage 
upregulate stage 




AF161422 


Hs.21590 


Homo salens HSPC304 mRNA, partial cds 






upregulate stage 


434963 


AW974957 


Hs.288719 


Homo sapiens cI3NA FU1 2142 Us, clone MA 


10.76 


0.07 


upregulate stage 




AW272262 










upregulate stage 




AW770553 
AW975155 


Hs.293640 
Hs.292163 








upregulale stage 




H91923 


Hs.1 10024 


NADHrubiquinone oxidoreduclase MLRQ siibu 






upregulate stage 


435030 


A1203316 


Hs.148655 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


2.95 


o" 


upregulale stage 




AA454985 




cadhen'n-Ske protein VR20 






upregulale stage 




A1651474 










upregulale stage 




AI831760 










upregulale stage 




AW975241 










upregulale stage 








Homo sapiens c[3NAFU11692 Us, clone HE 








435136 


R27299 


Hs.'l0172 


ESTs 


8.9 


0.07 


upregulale stage 




AA668879 


Hs.116649 


ESTs 


1.35 


0,25 


upregulate stage 


435162 


AI911044 


Hs.213893 


ESTs 






upregulate stage 


435166 


AI391470 


Hs.158618 


ESTs 


5.5 


0.12 


upregulate stage 


435212 


AW300100 


Hs.1 641 85 


ESTs 


1 




upregulate stage 


435237 


AI026836 


Hs.) 14689 


ESTs 


8.75 


0.1 


upregulale stage 


435255 


W87434 




ESTs, Moderately similar to ALU1.HUMAN A 


3.4 




upregulale stage 


435256 


AF1 93766 


Hsj3872 


cyiokins-like protein CI 7 


a2 


0.'l4 


upregulate stage 


435257 
435298 


AA677026 
AA677696 


Hs.191217 
Hs.189196 


ESTs 
ESTs 


4,5 


0.12 


upregulale stage 
upregulale stage 


435307 


W90610 


Hs.192003 


ESTs 


1 


0.87 


upregulate stage 


435347 


AW014873 


Hs.1 16963 


ESTs 


245 


0.14 


upregulate stage 
upregulate stage 


435382 


N54493 




gb7v40g0S.s1 Soaies fetal liver spleen 




056 


435408 


H07897 


Hs.4302 


ESTs 


0.84 


0.97 


upregulate stage 


435491 


T98543 


Hs.191900 


ESTs 




0.3 


upregulate stage 



wo U3/003906 



PCT/US02/21338 









ESTs 






upregulate stage 




AW305188 




























AI653240 


Hs.49823 












AA699633 














AW043836 


Hs.21 2460 










435809 


H90213 










upregulate stage 


435826 


AI554089 


Hs!l 17880 


ESTs 


6^55 


m 


upregulate stage 




AJ278120 














W03698 


Hs.83613 


ESTs, WeaklysiirtlarloALUI HUMAN ALUS 






upregulate stage 






Hs.188620 








upregulate stage 




Aid 5862 








o'o8 


upregulate stage 




AA7C3271 


Hs.269903 


ESTs, Moderately similar to ALU1_HUMAN A 






upregulate sfage 




AA806465 


Hs.121536 
















gb:yd95f05.s1 Scares fetal liver spleen 


9.15 




upregulate stage 




AM)21983 


Hs.271432 




0.23 


upregulate stage 


436115 


AW512033 


Hs.102004 


ESTs 


1,9 


0.21 


upregulate stage 






Hs.145080 








upregulate stage 




AI248193 


Hs.119860 




9,61 


0.08 


upregulate stage 






Hs. 159452 












AA705466 


H5.1 19900 








upregulate stage 




AW4S0381 
AA706315 


Hs. 192057 






oloi 


upregulate stage 




AW450963 


Hs. 11 9991 








upregulate stage 




AA732755 




























AK000767 
















Hs. 275736 












BE085236 














AI962796 












436363 


AA843926 
BE065178 


Hs:i 24434 


ESTs 

gb:RC1-BT0314-020200-012-h01 BT0314 Homo 


3.35 


067 


upregulate stage 




A1683487 




Homo sapiens CDNAFLJ11441 lis, clone HE 










AW274577 










upregulate stage 




AA716141 










upregulate stage 








hypothetical protein FLJ23499 
























AA326108 




ESTs, Wealcly similar to enhancer-of-spli 






upregulate stage 




AA721209 














AA766433 










upregulate stage 




AA721381 










upregulate stage 




AI091435 


Hs.i 34859 








upregulate stage 




AI690021 


Hs.201536 








upregulate stage 




AW976133 




gb£ST387239 MAGE resequences, MAGN Homo 




0.09 


upregulate stage 




AW976004 
AA745597 


Hs.291731 
Hs!291400 








upregulate stage 
upregulate stage 




AW74986S 


HS.29364S 








upregulate stage 




AA830173 


Hs.291918 




16 


0^27 






AA767346 


Hs.'291614 








upregulate stage 




AA766458 


Hs.122812 












BE328074 


Hs.148661 




5.0B 


0]l4 


upregulate stage 








PR01 91 4 protein 










AA742327 


Hs.292687 








upregulate stage 




AL035864 




ESTs, Highly similar to differentially e 










AA746563 




gb:iiy60e04s1 Na_CGAP_Pr18Homo sapiens 










AL049466 














R49202 


Hg 181694 










437181 


AI306615 


Hs'l 25343 


ESTs, Weakly similar to KIAA07S8 protein 


4.4 


0!05 


upregulate stage 








Homo sapiens cDNAFLJ12910fls, clone NT 










BE092336 




gb:IL2-BT0734-240400-O72-A12 BT0734 Homo 
























AI283085 


Hs. 290931 










437258 


AL041243 


Hs!l74104 


ESTs' ' 


9J2 


0'08 


upregulate stage 


437267 


AW51 1443 


Hs.258110 


ESTs 


425 


0.12 


upregulate stage 


437274 


AA747965 




gb;n)!79a10.s1 NCI CGAP Ew1 Homo sapiens 






upregulate stage 


437283 




Hs.160377 


ESTs 




0.61 


upregulate stage 


437311 






SWl/SNF related, matrix associated, adi 


3.1 




upregulate stage 


437324 


AL?62077 




gb:Homo sapiens mRNA; cDNA DKFZp761A219 




0.25 


upregulate stage 


437330 


AL353944 


Hs.50115 


Homo sapiens mRNA; cDNA DKFZp751 Jtl '2 (f 


7.95 


0.07 


upregulate stage 


437356 


BE622396 




Homo sapiens mRNA; cDNA DKFZp76201615 (f 






upregulate stage 


437438 


AL359620 


Hs.14217 


hypothetical protein DKFZp762P2111 


9.15 


0.09 


upregulate stage 


437471 
437553 


AL390169 
A1829936 


Hs.130497 


gb;Homo sapiens mRNA; cDNA DKFZp547D064 
ESTs, Wealdy similar to MAT8_HUMAN CHLOR 




1 

0.56 


upregulate stage 
upregulate stage 


437567 


AW627990 


Hs.288954 


Homo sapiens cDNA: FU21466 fis, clone C 






upregulate stage 


437575 


AW954355 


HS.36S29 


ESTs 


10.25 


0.06 




437717 


AA8(M765 


Hs.132853 


ESTs 




0.77 


upregulate stage 


437722 


AW292947 


Hs.1 22872 


ESTs 


9.75 


0.05 


upregulate stage 



120 
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20 
25 
30 



55 
60 
65 
70 



437799 R51083 

437852 BE001836 

437886 BE264111 

437887 AA811524 
437889 AAa30524 

437937 AI917222 

437938 AI960087 
437983 AI303023 

438032 BE045624 

438069 N80701 

438077 AA777330 

438081 H49546 

438102 AA777793 

438112 W86729 

438113 AI467908 
438119 AW963217 
438144 AA77B894 
438153 AI268632 
438171 AW9765Q7 
433271 L21934 
433290 AA843719 
438321 AA576635 
438325 AA804258 
438334 AA806992 
438366 AA805760 
438370 AA843242 
438374 AA321856 

438377 AA806070 

438378 AW970529 
438401 AL046321 
438403 AA806607 
438412 AA806776 
438448 AA807344 
438451 AI081972 
438473 H07986 
438487 AI684733 
438529 AW007287 
438534 AW204062 
438693 AA814360 
43B698 AW297855 
43B728 AA815202 
438746 AI885815 
43B80S AA826048 
438812 AA826199 
438817 AI023799 
438886 AA827728 
438913 AI380429 
438950 H23789 
438961 H42135 



Hs.90787 
Hs.256897 
Hs.31314 
Hs.29263 



gb:RC2-ST0165-300999-011-g02 ST0165 Homo 



439057 H59623 
439176 A1446444 
439179 AA831250 



Hs.298964 

HS.137S80 

Hs.194279 

Hs.8882 

Hs.203961 

Hs.118364 

HS.1461S9 

Hs29351S 

Hs.14553 

H3.122341 

Hs.6153 

Hs.123229 

Hs.291686 



439212 AF087995 

439223 AW238299 

439312 AA833902 

439330 AF086147 

439351 W37688 

439430 AF124250 

439444 A1277652 

439453 BE264974 

439476 AA836340 

439492 AF086310 

439527 AW298119 

439550 H10438 



Homo sapiens cDNA FL111896 fis. do 



gb:wq05c02.x1 NCI_CGAP_Kid12Hoinosapien 
hypolhelical protein FU22313 
splicing factor(CC1.3) 



Hs.197484 ESTs 
Hs.292206 ESTs 



ESTs, Moderately similar to AF1 16721 89 

ESTs 

ESTs 

ESTs 

sterol O-acyllransterase (acyl-Coenzyme 
ESTs 

CGl-48 protein 

ESTs 

ESTs 

gb:ns43f01,s1 NCl.CGAP.GCBI Homosaplens 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434C1717 (f 

ESTs 

0 sEfileiis cDNA: FU21816 Os, done H 



Hs.220261 ESTs 



Hs.28538 
Hs.1 23644 
Hs.249595 
Hs.125815 
Hs.25657 
Hs.184727 
Hs.117887 
Hs.44287 
Hs.163242 
Hs.128705 
Hs.172445 
Hs.144530 
Hs.101848 



ESTs 

Homo sapiens mRNA for partiel 3'UTR, seq 



ESTs, Weakly similar to AF149422 2 unkno 



gb:Homo sapiens full length InsertcDNA 



Hs, 190394 ESTs 



Hs.134877 
Hs,23945 
Hs,270745 

Hs,66158 
Hs,6564 
Hs.54578 



upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulatestage 
upregulale stage 
upregulatestage 
upregulatestage 
upregulatestage 
upregulatestage 
upregulatestage 
upregulatestage 
upregulatestage 
upregulate stage 
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30 
35 
40 





W79123 




ESTs, Weakly simil* to KI01_HUMAN PROBA 






upregulate stage 


439780 


AL3 59053 




Homo sapiens mRNA full length Insert cDN 
gb:Homo sapiens mRNA full lengtti Insert 






upregulate stage 
upregulate stage 




All 49520 










Upregulate stage 




AI571647 




hypottietical protein FLJ22969 






upregulate stage 


















AW993791 




ESTs 






upregulate stage 




AW247529 




platelet-^vaDng factor acetylhydrola 














tiypottieb'oat protein FU10430 






upregulate stage 




AA860116 


Hs. 223232 










439999 










0.07 


upregulate stage 








hypothetical protein FLJ20510 










AA861072 




gb:ak32e053l Soares testls_NHTIHo[nosap 










AA975145 
R43809 


Hs.22688 








upregulate stage 
Upregulate stage 




AF12S392 




insulin induced protein 2 








440249 




Hs]l25325 










AA912032 


HS.1810S9 










440334 


BE276112 












440348 


AW015802 














AF030933 




RAD1 (8. pomi}e) tiomolog 
























Hs.100554 








upregulate stage 




AV657117 










upfBguiate stage 


















AA904244 


Hs. J 53205 








upregulate stage 




AW993377 


Hs.1 30390 








upregulate stage 




AW449445 




diacylglycerol lOnase, alplia (SCltD) 










AA909651 












440980 


AL042005 




tripeptldyl peptidase II 








440994 










upregulate stage 


441092 












upregulate stage 




AA917075 


Hs.190520 














Hs.1 26632 








upregulate stage 




AI027604 














AW137827 










upregulate stage 








ESTs, We^ly simllarto TERA HUMAN |H.sa 






upregulate stage 




AA927170 










upregulate stage 




AI078234 










upregulate stage 


















AA931077 




Homo sapiens cDHA FU 12021 fis, done HE 






upregulate stage 






Hs. 126846 








upregulate stage 




AW294408 


Hs.222068 








upregulate stage 




AA356792 


Hs.301786 








upregulate stage 




BE503874 


Hs!301986 












AW274946 










upregulate stage 














upregulate stage 






Hs]l89133 












AW015203 














AW578981 


Hs.52184 


liypottiellcal protein FLJ20618 










AW473362 


Hs.127221 












AI802629 




Homo sapiens cDNA FLJ11631 fis, clone HE 










BE569122 




Rl^-t3inding protein (autoantigenic) 
























AW271702 










upregulate stage 




AA384673 




RPB5-niediati[^ protein 






upregulate stage 


















AW298818 


Hs.1 27341 








upregulate stage 






Hs 161377 










441762 


AW592203 


Hs!l44769 


ESTs 


l' 


m 


upr^ulate stage 




AW294909 


























441799 


AW292276 


Hs'.127872 


ESTs 


l' 


0.22 


upregulate stage 


441801 


AW242799 


Hs.211874 


ESTs 


8 


0.06 


upregulate stage 


441904 


AI633206 


Hs.128104 


ESTs 




0.19 


upregulate stage 


441955 


AA972327 


Hs.1 42903 


ESTs 


as? 


0.98 


upregulate stage 


441989 


AA306207 


Hs.286241 


Homo sapiens cDNA: FU22698 fe, otone H 


9.17 


0.07 


upregulate stage 


441990 


T66139 




ESTs 


3.55 


0.12 


upregulate stage 


442029 


AW956698 


Hs! 14456 


neural precursor cell expressed, develop 






upregulate stage 


442030 


W67167 


Hs.1 09080 






o!35 


upregulate stage 


442064 


AI422867 


Hs.88594 


ESTs 


aa 


0.08 


upregulate stage 


442071 


BE048433 


Hs.276043 




9.15 


0.09 


upregulate stage 


442093 


AA976049 


Hs.1 28484 


ESTs 






upregulate stage 


442194 
442202 


AA984389 
BE272862 


Hs.205088 
HS.106S34 


ESTs 

Homo sapiens cDNA: FU22625 Bs, clone H 


ao 


0.83 
0.08 


upregulate stage 
upregulate stage 


442203 


A1921423 


Hs.250146 


ESTs 


1 






442214 


AI681733 


Hs.1 29003 


ESTs 


22 


0.26 


upregubte stage 



wo U3/003906 



PCT/US02/21338 



20 
25 
30 
35 



442216 A1733468 

442295 AI827248 

442319 BE048144 

442432 BE093589 

442510 AF150179 

442513 AF150226 

442539 ALU 9606 

442552 R2Q624 

442562 BE379584 

442564 AI590207 



442597 AI499214 

442511 BE077155 

442612 AI006233 

442619 AA447492 

442S42 R51B53 

442660 AW138174 

442696 BES66962 

442712 BE465168 



442856 H56735 

442861 AA243837 

442875 BE623003 

442879 AF032922 

442883 AW195774 

442961 BE614474 

442966 A1394036 

442980 AA857025 

442992 AI914699 

442994 AI026718 

443054 AI746185 

443113 AI04D686 

443119 AA312264 

44317) BE28H28 

443211 AI12a388 

443242 BE243910 

443243 AI452496 
443247 BE614387 
443270 NM 004272 
443299 AI733642 
443362 AI0S3464 
443383 AI792453 
443411 AW134566 
443426 AF0981S8 
443447 AI094222 
443542 A1927065 
443557 AV645987 
443534 AI807036 
443606 AI078664 
443634 H73972 
443640 AI872643 
443715 AI583187 
443799 AA150320 
443899 AW842283 
443917 AW503739 
443919 AI091284 
443967 AW294013 
443977 AL120986 
443979 AV647366 
444020 Rg2962 



Hs,129006 ESTs 



Homo sapiens cDNA aJ1 1469 Us, done HE 
CGI-102 protein 

Homo sapiens cDNA: FLJ23468 fis, clone H 

ESTs 

gb:AF150226 Human mRNA from cd34+ stem c 
Homo sapiens oDNA; FU23006 fIs, done L 
son of sevenless (Drosophia) hoimlog 1 



ESTs. Weakly sMarto AF164793 1 prote 

ESTs 

ESTs 

Homo sapiens cDNA; FU20913 fIs, clone A 
ESTs 

ESTs, Weakly Glmliarlo KIAA1205 protein 



ls.57787 ESTs 



Homo sapiens cDNA FU13611 Us, dc 



Homo sapiens GDNAFU13986 Us, done Y7 
ESTs, Weakly similar to dual spedfldty 

kinesin-like 1 



es-assoclaled protein 65 kDa 



chromosome 20 open reading frame 1 



Hs.256212 ESTs 



444166 AV648429 

444270 AI138580 

444271 AW452569 
444282 AI138955 
444333 A1262567 
444342 NM_01439S 
444378 R41339 
444381 BE387335 
444431 AW513324 



gb:q(J79b07.x1 SoaresJeslls_NHT Homo sap 
trinucleotide repeat containing 15 
similar to lysosome-assodated membrane 
ESTs 

ESTs, Weakly similar to CA54_HUMAN COLLA 



upregulate stage 
upnegulale stage 
upregulate stage 



p eg I; 



ilate slags 
ilate stage 



upregulate stage 
upregulate stage 

upregulate stage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulate slage 
upregulate stage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate slage 
upregulate stage 
upregulate stage 
upregulate slage 
upregulate slage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



123 
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444437 A1377961 

444444 AI149332 

444525 AW593778 

444584 AI168422 

444599 AI174377 

444646 AI184565 

444649 AW207523 

444675 AI186380 

444698 AI188139 

444743 AA045648 

444762 AI733700 

444781 NU.014400 

444783 AK001468 

444838 AV651680 

444849 A1199438 

444950 A1960256 

445027 AV652692 

445091 A1662164 

445098 AL050272 

445101 T75202 

446250 AI697838 

445258 AI635931 

446390 A1222165 

446396 BE181792 

445436 A1224105 

445444 AA380876 

445483 AI307150 

445496 AB007860 

445525 BE149866 

445527 W39694 

445537 AJ245671 

445546 AW468821 

445576 AI793233 

446623 AI245366 

445640 AW969626 

446668 A1657114 

445766 A1623607 

445770 ALII 9499 

446778 AA196443 

446787 A1263167 

445814 H92020 

446818 BE046321 

446829 AI452457 

445832 AI261545 

445873 AA250970 

446880 AV656474 

445883 AF070559 

446908 R13580 

446939 BE018658 

446019 AI362520 

446064 AB014537 

446062 AA211738 

446080 A1221741 

446082 AI274139 

446099 T93096 

446119 D29527 

446120 N25080 

446126 AW085909 

446127 AA333608 
446162 AI292036 
446196 AI744888 
446229 AI744964 
446248 AI283014 
446269 AW263155 
446303 X77244 
446312 BE087853 
446332 AK001635 
446356 AI816736 
446362 AW612481 
446398 AI681317 
446411 AI298828 
446474 AI301227 
446501 AI302616 
446507 AA352554 
446526 H89616 
446555 AV6S9046 



gb:ok30e11.x1 Soares_NSF.F8 9W_0T.PA^P_S 

ESTs 

gb;qd60b08.x1 SoaresJestis_NHT Homo sap 



nudix (nucleoside diphosphate United moi 



HS.1484B0 
Hs.224875 
Hs.282498 
Hs.147294 
Hs.12306 
Hs.12314 
Hs.175621 
Hs.147613 
Hs.144923 



plecksirin homology, Sf 



Hs.13413 
Hs.13436 

Hs,141003 
Hs,94133 



Hs,149158 ESTs 



Hs.145396 
Hs.101524 
Hs.136017 
Hs,145528 

Hs.251946 
Hs,131068 ESTs 



ESTs, Weakly similaf to KIAA0227 |H.sapi 

EST 

Homo sapiens oDNA FU13490 fis, clone PL 
neuronal potassium channel alpha subunit 
Homo sapiens cDNA FU13S5B fis, clone PL 
ESTs, Weakly similar to ALUC_HUMAN 1111 



Homo sapiens clone 24463 mRNA sequence 
Homo sapiens clone 24425 mRNA sequence 
Homo sapiens cDNA: FU21691 lis, clone C 



^s,255^14 ESTs 



Hs.150186 
HS.1S0819 
HS.151B4 
Hs.296290 
Hs,201847 ESTs 



gh:HUMNK667 Human epidennal kerallnooyte 



ubiquilously transcribed telralrlcopepti 



hypothetical protein FLJ10540 
malic enzyme 1 , NADP(+)-dependenl. cytos 
gb;QV1-BT0681-290400-181-h05 BT0681 Horn 
hypolhetfcal protein FU10773 
DHHC1 protein 



upr eg ulate stage 
upregulale stage 
upregulale stage 
upregulate stage 



upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 



9.B 0,08 upregulate stage 

3.05 0,13 upregulate stage 

10.6 0.06 upregulale stage 
1 0.29 upregulate stage 
4.65 0.12 upregulate stage 
1.35 0.22 upregulale stage 

10.7 0.07 upregulate stage 
1 1 upregulate stage 
12.05 0.06 upregulale stage 
10.65 0.06 upregulale stage 
4.85 0.1 upregulale stage 
9.7 0.06 upregulate stage 
4.2 0.13 upregulale stage 
1 0.31 upregulale stage 
1 1 upregulate stage 
4.7 0.13 upregulate stage 
1 0,34 upregulate stage 
1 0,23 upregulate stage 
1 1 upregulate stage 
9.7 0.07 upregulate stage 
3.1 0.13 upregulate stage 
1 1 upregulate stage 
1 1 upregulate stage 
1 0.37 upregulate stage 
3.21 0,22 upregulale stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
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35 



446677 


AB040933 


Hs.15420 


K1AA1500 protein 




0.61 


upregulale stage 


446629 


A1435046 


Hs.155148 


Homo sapiens cDNA: FLJ23082 fis, clone L 




0.25 


upregulale stage 


446636 
446682 


AC002563 
AW205632 


Hs.15767 


citron (rlio-interacling, serine/ttireonin 


2.54 


0.28 
0.18 


upregulale stage 
upregulale stage 


446701 


AK001621 


Hs.15921 


iiypollielical protein FU10759 


1.32 


0.69 


upregulale stage 




AV660019 


Hs.282676 








upregulale stage 




W39500 




ESTs, Weskly siirilar to LONN_HUMAN liflTOC 




0.07 


upregulale stage 




AI439136 


Hs.140546 




4!55 


0.12 


upregulale stage 




AV660348 


Hs.282688 






0.91 


upregulale stage 




AA128965 




TATA box binding protein (TBP)-assodate 




0.06 


upregulale stage 




W03766 






8.9 


0.09 


upregulale stage 




BE1 79030 










upregulale stage 




BE091926 






14!05 


0.05 


upregulale stage 














upregulale stage 


446880 


AI811807 


Hs.1 08646 


Homo sapiens cDHP> FU12534 fis, clons NT 






upregulale stage 






HS.2934S4 


gb;RC6-HT0580-10()50(M)22-H07 HTC680 Homo 
ESTs, We^ly simll v to Similarity to Ye 


2.75 


o!i6 

0.06 


upregulale stage 
upregulale stage 


446988 


AW339533 


Hs]272108 


ESTs 




0.29 


upreguiate stage 




AK001898 


HS.1S740 


liypotlietlcal protein FIJ11036 


1.75 


0.51 


upregulale stage 






HS.2B2735 








upreguiate stage 






Hs.1 68678 








upreguiate stage 


447069 


A1359927 


Hs!l57722 


ESTs 




a4 


upregulatestage 




AW885727 






44 


ai3 


upregulatestage 




AI41B781 


Hs,3DD144 








upregulatestage 




Y 13896 








o!l2 


upreguiate stage 




AB01 4599 




KIAA0699 protein 






upregulatestage 




AA805202 




eul<aryotlc translation inltlsHon factor 


6.2 


o!i2 


upregulatestage 






Hs.293545 








upregulatestage 




AI685286 


Hs.280386 




1.25 




upregulatestage 




BE617056 


Hs.283000 






0.16 


upregulale stage 




AW1 92200 










upregulatestage 




BE047911 
AW1 97097 




gb:tz44a05,y1 NCI_CGAP_Bm62 Homo sapien 
transcriptional intermediary factor 1 


1 


0.23 


upreguiate stage 
upregulale stage 








melanoma antigen, family A, 3 






upregulale stage 




AA515032 










upregulale stage 




AI199263 
AA256641 




ESTs, Highly similar to LRPl.HUMAN LOW-D 


5!95 


0.09 


upregulale stage 
upregulatestage 




AI376747 




gb:tc35ti05,x1 Soares_tolal_fetus_Nb2HF8_ 




0.33 


upregulatestage 




BE247676 










upregulatestage 




AI742989 






3!65 


o!l3 


upregulatestage 




AK000318 


Hs.18616 


hypothetical protein FLJ20311 


9.2 


0.08 


upregulatestage 




U46258 


Hs.23448 




14.4 


0.06 


upregulatestage 




BE143888 




gb:l«IR0-HT0165O81199-001-b04 HT0165 Homo 






upregulatestage 




AA912347 


Hs.1 36585 






a3 


upregulatestage 






Hs.1 70995 








upregulatestage 




N87079 


Hs.19236 


NADH dehydrogenase (ubiquinone) 1 beta s 




0.12 


upregulatestage 




BE619626 


Hs.266527 


ESTs, Weakly similar to unnamed protein 




0.46 


upregulatestage 






Hs.167253 








upregulatestage 






Hs.161338 




3.9 


aii 


upregulatestage 








protein tyrosine phosphatase, receptor t 


1.44 


a59 


upregulatestage 




BE620886 


Hs!23037 




12.15 


ao6 


upregulatestage 




AI452973 


Hs.l 65900 


ESTs, Wealdy simitar to ALUC_HUIi4AI>l llli 




0.08 


upregulatestage 




AI457097 


Hs!255905 








upregulatestage 




AI457098 


Hs.280848 








upregulatestage 












0.13 


upregulale stage 




AW611770 


Hs.246868 








upregulale stage 




AJ 297436 














A1458998 












AW295923 


Hs.255472 












AW847925 
















Hs.203230 












NM 005591 


Hs.202379 








upregulatestage 


448168 


AWe05999 


Hs.'22549 


hypctiieScal protein FU12799 


{ 


o!77 


upregulatestage 


448236 


AA890449 


Hs.20766 


oxyslerol 7alpha4iydroxylase 






upregulatestage 


448256 


BE614149 


Hs.20814 


CGI-27 protein 


11.95 


0.07 


upregulatestage 


448289 


AW390251 


Hs.202402 


ESTs 


1 




upregulale stage 


448366 


AL1 20837 


Hs.20993 






o!o7 


upregulale stage 


448357 


N20169 


Hs.108923 


ESTs 


1.34 


0.61 


upregulatestage 


448408 


AA322866 


Hs.21107 


neuroiigin 


1.7 


0.24 


upregulatestage 


448466 


AI252625 


Hs.269860 




8.8 


ao9 


upregulatestage 


448459 


AW069838 


Hs.171055 


ESTs 




0.27 


upregulatestage 


448464 




Hs.196093 


ESTs 


10.35 


0.06 


upregulatestage 


448468 


BE550361 


Hs.171072 


ESTs 






upregulatestage 


448502 


AW605286 


Hs.239699 


ESTs 


9.3 


0.08 


upregulatestage 


448552 


AW973653 


Hs.20104 


hypothetical protein FUG0052 


4.75 


0.13 


upregulatestage 


448566 


AW885606 


Hs.6064 




9.8 


0.08 


upregulatestage 


448569 


BE3826S7 


HS.2148S 


signal transducer and activator of trans 


2.14 


0.36 


upregulatestage 
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446643 A1557531 

446649 T94590 

446663 BE614S99 

446660 AW245890 

446726 AA193251 

446729 BE614536 

446743 AB032962 

448826 AI580252 

448914 A1927656 

448946 AI652855 

448968 AB020651 

448974 AL049390 

448979 AI611378 

449008 AW678003 

449032 AA045573 

449053 A1625777 

449067 AB037784 

449146 AW636677 

449203 AI634576 

449207 AU)44222 

449219 AI637581 

449230 BE613346 

449246 AW411209 

449318 AW236021 

449328 AI962493 

449343 Al1514ia 

449344 A1640355 



449434 AW2948S8 

449437 A1702038 

449474 AA019344 

449523 NM.Q00579 

449628 H63337 
449565 A1824925 
449568 AL1 57479 
449618 AI076459 
449639 AA001968 
449666 AA002047 
449704 AK000733 
449722 BE2B0074 
449764 N93104 
449764 AW161319 

449629 N51440 
449643 RB5337 
449892 N73608 
449894 AK001578 
449919 A1674685 
450020 AI680684 
450033 R43010 
450063 AI681509 
450083 AA131795 
450116 AA005355 
450121 AL040174 
450135 AI810816 
450144 T63961 
450149 AW969781 

450151 AIC88195 

450152 AI138635 
450195 AA007352 
450221 AA328102 
450238 T89693 
450257 AW82C313 

450313 AI038989 

450314 AA574309 
450350 T97817 
450411 D61167 

450448 D54299 

450449 AI696696 
450506 NM.004460 
450673 AW964334 
450628 AW382884 
450636 AI703076 



Hs.293246 
Hs.196459 
Hs.155796 
Hs.22663 
Hs.22689 



Hs.22900 
Hs.270344 
Hs.22941 
Hs.267564 
Hs.262121 
Hs.23255 
Hs.195012 
Hs.23348 
Hs.23363 
Hs.108788 
Hs.197647 
Hs.272458 

Hs.200760 
Hs.23495 
Hs,197030 
Hs.195684 
Hs.1 97641 



H.sapiens gene from PAC 42616, sii 



ESTs, Weakly similar to ALUS.HUMAN ALU S 
K1AA1136 protein 

ESTs, Weakly similar to putalive pi 50 [H 



Homo sapiens mRhJA; cDNA DKFZp66601318 (f 



Hs.10 



Hs.59956 

Hs.23900 
Hs.23960 
Hs.54695 
Hs.12915 
Hs.47261 
Hs.24030 
Hs.50309 
Hs.24129 
Hs.200141 
Hs,282219 
Hs.269452 
Hs.277133 
Hs.142001 
Hs.222882 
Hs.2a8927 
Hs.201142 
Hs.301851 
Hs.293440 
Hs,295233 
Hs.22968 
Hs.256042 



S-phase 
liypotlietical protein FLJ10983 
ESTs, Weakly similar- ' " 



tiypotiietical protein FLJ11252 



Homo sapiens cDNA: FLJ22902 lis, clone K 
ubiqultin-aotivatirg enzyme El (AISSTan 
chemokine (C-C motil) receptors 
Homo sapiens cDI>JA aJ23466 fls, clone H 
ESTs 

KIAA1598 protein 

Homo sapiens cDNA FLJ128191IS, ctone NT 
ESTs, Higiily similar io »/IGR7_HUMAN METAB 
gb:zh84eC5.r1 SoaresJetaLIIverjspleen_ 
GTPase aolivating protein 
cyclinBI 

ESTs, Weakly similar to ZNF91L [H.saplen 



sdute carrier family 31 (copper transpo 

ESTs 

tiypotiieticai protein FLJ1C716 



ESTs, Weakly similar to JH0148 m 



m s^ens cDNA: FU22944 fls, ctone K 



ESTs, Moderately similar to ZIC2 protein 



Hs.138777 ESTs 



Hs.24809 
Hs.283402 
Hs.1 74880 
Hs.202156 
Hs.25010 
Hs. 36244 
Hs.202068 
Hs.418 



cytoskeleton associated protein 2 



tiypotiietical protein FU 10826 



upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulatestage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulatestage 
upregulatestage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulatestage 
upregulale slage 
upregulate stage 
upregulatestage 



upregulate 



upregulatestage 
upregulatestage 
upregulatestage 
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450655 A1707846 

450664 AA808358 

460680 AF131784 

450722 A1732318 

450751 AI733251 

460772 BE326391 

450800 BE395161 
450824 



Hs.279860 
Hs.36830 
Hs.26318 
Hs.101120 
Hs.125353 
Hs.280146 



AW970602 

311471 



450870 , 

460937 R49131 

460983 AA306384 

451052 AA281504 

461067 BE172186 

461088 AA015600 

451094 AI949825 

461096 BE383234 

461126 H30600 

451161 AA211329 

461166 T98171 

451222 AA018386 

451225 AI433694 

461228 AI767166 

451246 AW189232 

451265 AA016292 

451276 AW2943B6 

451277 AK001123 
451291 R39288 
461326 AW296946 
461347 AI288579 
461359 H85334 



461580 AW138195 

451592 AI805416 

461651 A1097337 

451658 AW195351 

461684 AF216751 

451690 AW451469 

461724 AI903765 

461743 AW074266 

461794 AA019799 

451844 T61430 

461903 W19617 

451914 AIB22115 

451938 A1354355 

451939 U804S6 
461971 AA021185 
451998 AW594129 
452028 AK0018S9 
452036 NM_003966 
452099 BE612992 
462122 AF216833 
452163 AI863140 
452179 HI 6725 
462198 A1097560 
452206 AW340281 
452234 AW084176 
462240 AI691147 
452247 AL137432 
462250 BE618654 
452256 AK000933 
452266 AI767250 
462277 AL049013 
452281 T93600 
452291 AF015592 
452328 AA805679 
452331 AA598S09 
452345 AA293279 
462367 U71207 



Hs.269211 
Hs.107708 
Hs.1 84326 
Hs.213897 
Hs.88977 
Hs.250520 



Hs.28 



HS.2B853 
HS.61271 
Hsi9117 
Hs^9173 
Hs.29279 



hypolhelical proleln FU20030 
ESTs 

Homo sapiens clone 25194 mRNA sequence 
ESTs 

ESTs, Weakly similar to JU0033 hypotheH 
ESTs, Weakly similar to JU0033 hypolheS 
ESTs, Weakly similar to ALU5_HUUAN ALU S 



Hs.180789 
Hs.82415 
Hs.260395 
Hs.25925 
Hs.40910 
Hs.25006 
Hs.1 85675 
Hs.64341 
Hs.293608 
Hs.207025 
Hs.39140 
Hs.290849 
Hs.236533 
Hs.26176 
Hs.6702 
HSJ00967 



Homo sapiens done 23860 mRNA sequence 

ESTs 

hypotheUcaipiDtelnFLJIOSSg 



us T-cell lymphoma tumor antigen 



Hs.261003 
Hs.270618 
Hs.16697 
Hs.27311 
Hs.226306 
HS213666 
HS.27S95 
Hs.27621 
Hs.27931 
Hs.1710 

Hs.27463 
Hs.61210 
Hs.33074 
Hs.223296 
Hs.61232 



gb:ys90s05,r1 Soares retina N2bSHR Homo 
gb:op20liiayS Na_CGAP_Co12 Homo sapiens 
spastic paraplegia 4 (autosomal dominant 
ESTs 

gb:ze51g02.r1 Soares retina N2b4HR Homo 
ESTs, Higlily similar to CIK1_HUMAN VOLTA 
gb;yl07a01.r1 Soares retina N2b6HRHomo 
gb:EST3826S8 MAGE resequences, k/IAGK Homo 
ESTs, WealtlyslmOarto B34087 liypotlieti 
ESTs 

CDC10 (cell division cycle 10, S, cerevi 
ESTs 

hypothetical proieindJ511E16.2 



gb:UI-BT037-301 298-102 BT037 Homo saplen 

ESTs 

ESTs 

gb:yc06^3.s1 Stratagenelung (937210) H 
ESTs, IMerately similar to B34087 hypot 
ESTs, Weakly similar to KIAA0822 proleln 
down-regulator of transcription 1, TBP-b 
single-minded (Dnisophlla) homolog 2 



hypothetical protein FU10607 similar to 
ATP-bindIng cassette, sub-family B (MDR/ 
gb:tz43h12.x1 NCI_OGAP_Brn62 Homo saplen 



ESTs, Moderately similar to ALU1_HUMAN A 



hypothetical protein DKFZp761E1824 

hypothetical protein A-211C6.1 

Homo sapiens oDNA FIJ10071 fis, ctane HE 

ESTs 

K1AA1223 protein 

Homo sapiens cDOIA FU1 1041 fis, clone PL 
CDC7 (cell dMslon cycle 7. S. cetevisi 
ESTs 

H^apiens mRNA for pur alpha extended 3' 
hypothetical proleln FU20S15 
eyes absent (Drosophlla) homotag 2 



upregulate stage 
upreguiate stage 
upregulate stage 
upreguiate stage 



8.9 0.09 upreguiate stage 



upreguiate stage 
upreguiate stage 
upreguiate stage 
upreguiate stage 



upregulate stage 
upreguiate stage 
upreguiate stage 



upregulate stage 
upreguiate stage 
upreguiate stage 
upreguiate stage 
upregulate stage 
upregulate stage 
upreguiate stage 
upregulate stage 
upreguiate stage 
upregulate stage 
upreguiate stage 
upreguiate stage 
upregulate stage 
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452401 NM_007115 

452404 AW460675 Hs,212709 

452430 AF118083 H5,29494 

452436 BE077546 Hs.31447 

452457 AW0624gg 

452461 N78223 Hs,108106 

452518 AA280722 Hs,24758 

452519 BE006701 

452524 AW136499 Hs,29796 

452531 AA429462 Hs.293946 

452547 AA335295 Hs,74120 



452571 W31518 

452607 A1160029 

452677 BE167202 

452680 AW138410 

462724 R84810 

452738 AL133800 

462741 BE392914 

462747 BE153855 

452761 BE244742 



462846 AA082160 



462956 AW0O3578 

462974 BE09O803 

463011 N62952 

463050 AW1 36479 

463074 AA031813 

463076 A1978683 

463123 AI953718 

463134 AA032211 

463135 T07866 
463137 AI954733 
463144 AW268807 
453153 N53893 
463156 BE463762 
463204 R10799 
453228 AW628325 
463274 AA018511 
453293 AA382267 
453321 AI984381 



453775 NM-002916 

453776 R16749 
463846 AL157586 
453684 AA355925 

463900 AW003582 

463913 AW004683 

453925 AW021088 

453931 AL121278 

463945 NM_005171 

454032 W31790 

454049 AW022885 

464069 AW025160 

454099 AW062974 

454111 AW081681 

464219 X76042 

454259 AL110136 

454327 BE064097 

464331 AW372937 

454380 AWB53722 



Hs.34665 
Hs.61438 
Hs.212065 
Hs.45051 
Hs.30464 



H3.13S14S 
HS.20429S 
Hs.22481 
HS.2881S8 
Hs.8687 



Homo sapiens oDNA: FLJ23Sgi lis, do 
ESTs 

Human IMSCL-1 mRIM sequence 



Hs.1 58006 hypolhetlcal protein 

Hs.4213 ESTs 

Hs.231872 ESTs 

Hs.61506 ESTs 



Hs.224046 
Hs.271880 
Hs.232161 
Hs.221849 



Homo sapfens done 26129 mF?NA sec 



Hs.61646 
Hs.24360 
Hs.223784 

Hs!232327 
Hs.32769 
Hs.10653 
HS.232S21 
Hs.17996 
Hs.32963 
Hs.79981 
Hs.89474 
Hs.257789 
Hs.24633 
Hs.159397 
Hs.300284 
Hs.36120 
Hs.31677 

Hs,36232 
Hs.226414 
Hs.233502 
Hs.181614 
Hs.25144 



Homo sapiens mRNA M lengtli Insert cDN 



cadherin 6, type 2, K-cadlierIn (fetal kl 
Human done 2356C mPUfy sequence 
ADP-^itnsyiatlon factor 6 



onfactorC{aotivator114{37 

gb:DKFZp761H0216_n 761 (synonym: Iiamy2) 

KIAA0186 gene product 

ESTs , Wealdy similar to ALU6_HUMAN ALU S 



activating transcription factor 1 
ESTs 

gb:df45e05.y1 ^A)rton Fetal Codilea Homo 
ESTs, ivloderately similar to AHJ1_HUMAN A 
gb:IL1-ST0041-020899a)1-H08ST004i Homo 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate slage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulale stage 
upregulate stage 



upregulale slage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate slage 
upregulate stage 
upregulale slage 
upregulate stage 
upregulate stage 



Homo sapiens mRNA; cDNA DKFZp564l112 (Ir 
gb:QV3-BT0297-231 199^)20-h08 BT0297 Homo 
gb:QV3-BT0381-1612994)42-aG9 BT0381 Homo 
gb:RC3-CT0347-2B119W)11-c04CT0347 Homo 



128 



wo U3/003906 



PCT/US02/21338 



454592 AW810112 

454648 AW811960 

464687 AW814473 

454692 AW813350 

454702 BE146915 

454729 AW817003 

454789 BE1563U 

464797 BE161153 

454863 AW835610 

454893 AW837753 

454951 AW847464 

454956 AW847725 

455047 AW862630 

455128 AW861555 

455201 AW947884 

455207 AW994394 

455331 AW897292 

455361 AW901942 

455380 BE160188 

455414 AW936969 

455428 AW938204 

455573 BE004988 

455586 BEQ70794 

455595 BEQ08343 

455610 BE011703 

455647 BE064415 

455550 BE064655 



455761 BE080895 

455799 BE169911 

455831 BE144966 

455874 BE152283 



455950 BE161004 

456951 BE161001 

456966 BE167014 

455981 BE1770D0 

456034 AW460979 

456046 RS1494 

456122 R11813 

456212 N51636 

456265 AI966210 

456285 R675B5 

456320 A1734064 

456353 AI042330 

456486 AA676544 

456493 AA261830 

456504 AKC00532 

456508 AA502764 

456519 AA279917 

455536 AW1 35986 

456592 R91600 

456621 T35958 

456582 AW500321 

455726 H43102 

45673S AW248217 

456785 AKC02084 

455800 ALII 8754 

456823 AL161979 

456844 A1264155 

456999 AA319798 

457015 AA688058 

457030 AI301740 

457158 AA135370 

457190 AI753247 

457309 AF131843 

457376 AI026984 

457402 AW452648 

457435 AW972024 

457437 AW969732 

457465 AW301344 

457467 AW974815 



Hs.71818 ESTs 



gb:RC2-CT0304-080100-011-b12 CT0304 Homo 
gb:MR4-ST01 24-100400-006-607 ST0124 Homo 
gb;RC2-ST01 68-240300-017-109 ST0168 Homo 
gb;MR3-ST0203-010200-1CS-c1 1 ST0203 Homo 
gb:MR3-ST0192-100100-024-g07 ST01S2 Homo 
gb;MRC-HT0208-221299-204-h0B HT0208 Homo 
gb:QVO-ST0247-040100-081-f03 ST0247 Homo 
gb:QVO-HT0367-150200-114.d02 HT0367 Homo 
gb:PM0-HT0425-170100-0G2-a10 HT0425 Homo 
gb:QV4-LT0016-090200-100-c02 ^70016 Homo 
gb:CM1-LT0042-310100-112-g03 LT0042 Homo 
gb:QV2-LT0051-240300-097-e12 LT0051 Homo 
gb;RC3-CT0208-270999-Q21-li12 CT0208 Homo 
gb:IL3-CT0213-18O20O-041-H10 CT0213 Homo 
gb:PM1-CT0243-071099-001-g06 CT0243 Homo 
gb:RC2-CT0321-110100-013-b05 CT0321 Homo 
gb:PM1-MTO010-20030O-001^O8 MT0010 Homo 
gb:RC3-BN00364J60400-014-h1 2 BN0036 Homo 
gb:CM0-NN0O57-15040O-338-bO2 NN0057 Homo 
gb:QVO-NN1022-10C400-190-b04 NN1022 Homo 
gb:QV1-HT0413-01O20O-069-g05 HT0413 Homo 
gb:RC1-DT0029-1602004l13-f10 DT0029Hoto 
gb:Q\A)-DT0048-170200-124-f01 DT0048Homo 
gb:MR2-BNO114-10CSOO-02O-bO4 BN0114 Homo 
gb:RC3-BT0501-130100-011-h02 BT0501 Homo 
gb:CMO-BN01544)80400J26-g10 BN0154 Homo 
gb:CM3-BN0223-100600-177-h09 BN0223 Homo 
gb:RC4-BT0311-241199-012-b03 BT0311 Homo 
gb:RC1-BT0313-301299-012-c09 BT0313 Homo 
gb:RC1-BT0314-310300-015-b12BT0314Homo 
gb:RC2-BT0318-241199-Q11-g02 BT0318 Homo 
gb;RC3-BT0319-120200-014-d09 BTC319 Homo 
gb;QV1-BT0631-280200-C84-e01 BT0631 Homo 
Homo sapiens cDNA FLJ22530 fis, clone H 
gb;RC6-HT01 87-201099-031-C04 HT0187 Homo 
gb:QV4-HT0316-191199-C39-b01 HT0316Homo 
gb:PM1-HT0350-231299-005-g05 HT0350 Homo 
gb:MRO-HT0407-140200-009-e06 HT0407 Homo 
gb:PM0-HTO425-17010O-002-h03 HT0425 Homo 
gb:PM0-HT0425-170100-002-f10 HT0425 Homo 
gb;CM2-HT0502-140200-088-d08 HT0602 Homo 
gb:RC4-HT0587-070400-015-b07 HT0587 Homo 
gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCI_CGAP_Su 



Hs.173623 
Hs.268748 
Hs.136212 

Hs,87128 
Hs.171545 

Hs,98491 



Hs.261544 
Hs,173381 
Hs.188535 
Hs,29643 
Hs,239340 



ESTs, Weakly similar to similar to YBS4 
HIV-1 Rev binding protein 
gb:zs17g09.r1 NCLCGAP_GCB1 Homosaplens 
Homo sapiens cDNA FU20625 fis, clone KA 
ESTs, Wealdy similar to AF208855 1 BM-01 
ESTs, Weakly similar to Uriknown (H.sapie 
ESTs 



ires fetal livet spleen 
'IAFLJ12360lis,clonel 



ESTs 

achaete-scute complex (Drosophila) honnol 
hypothetical protein FU 11222 
gb-.DKFZp761P1910j1 761 {synonym: hanny2) 
Homo sapiens mRNA; cDNA DKFZp751G1823 (f 
CDP-diacylglycerol synthase {phosphaMa 
eukaryotlc translation elongalion factor 

dihydropyrimidinase-like 2 
Homosaplens cDNA FLJ21635ris,cbneC 
Homo sapiens cDNA FU13103 lis, clone NT 
Homo sapiens clone 24987 mRNA sequence 



wo U3/003906 



PCT/US02/21338 



10 
15 
20 



45 
50 
55 



457474 


AW972935 




gb:EST385a31 MAGE rssequences, MAGM Honro 
ESTs 


1 


0.29 




AW973713 


Hs.293595 




0.39 


457637 


A1288373 


Hs.149875 


ESTs 


1 






AI375499 












AA649162 


Hs.236456 
Hs.128596 




8.9 


o!o8 
0.9 




AA744046 


Hs.133350 








457857 


AW814892 


Hs.273104 


ESTs 

gb:ny51e10.s1 NCLCGAP_Pr18 Homo sapiens 


1 


0.06 




AI624876 




MAD (mothers against decapentaplegic, Dr 








AA765625 


Hs. 155690 




3.55 






A1498540 












NM 016353 












AW976942 








o!87 


458079 




HsJ5195 
Hs.54277 


ESTs, Highly similar to I(3;5)(q25.1;p34 
ESTs 


3.45 
11.5 


0.12 
O.06 


458158 


AW29677B 


HS.3Q0357 


ESTs, Highly similar b dJ416F21 .2 [H.sa 








AI42001S 


HS.192Q90 










BE007237 




gb:PMO-BN0139-OS0500-003^09 BN0139 Homo 




0.16 






Hs.153205 




4.6 


0,15 




BE299S88 




Homo sapiens cDNA; RJ21869 lis, clone H 




0.16 


458282 


M984075 


Hs]l13631 
Hs.22580 


ESTs 

alkylglycemne phosphate synthase 




0.67 




AA937666 






5.05 


ai3 


458580 


Z30118 


Hs.'293788 


ESTs, Moderately similar to unnamed prot 




0.28 


458586 


AI683479 


Hs.65390 


ESTs 




0.07 


458608 


AW444662 


Hs.202247 


ESTs 




a27 


458632 


AI744445 


Hs.24650 


Homo sapiens cDNA FLJ1 3047 fis, clone NT 


1.05 


0.23 


458663 


AV558444 




Homo sapiens cDNA FLJ13684 fls, clone PL 


5.05 


0.13 


458670 
458680 


AI301987 
N73773 


Hs!233398 
Hs.124740 


ESTs 
ESTs 
ESTs 


8.9 


0.08 
0.23 
0.3 


458722 


AA741545 


Hs.282832 


ESTs 






458747 






ESTs, Weakly similar to GTPase-aclivatin 




0J4 


458760 


AI498631 




ferritin, light polypeptide 




0.07 


458781 


AI444821 




gb:IRET4B7 subtracted retina cDNA libraiy 


6.05 


0.12 


458801 
458880 


AA046742 


Hs.276860 


ESTs 

gb:zf48c09/1 Scares retina N2b4HR Homo 


4.45 


0.13 


458886 
458946 
469023 
459028 


AA009716 
AWg68226 
Aig40577 


Hs.103277 
Hs.42311 
Hs.60798 


ESTs 
ESTs 
ESTs 

gb:IL5-HT0009-120799-001-G07 HT0009 Homo 


2.95 


0^3 
0.08 

0.17 



ESTs, Weakly similar to hypothettosl pro 



459128 AI902169 

459182 BE178517 

469204 AW194601 

459256 AW967468 

459319 NM.Oa)059 



Hs.281935 ESTs 



459464 AA854847 



401519 
402474 
402727 



407242 Ml 8728 

407347 AA829847 

407796 AA1 95509 

408243 Y00787 

408380 API 23050 



409417 AA156247 

410315 A1638871 

410324 AW2g2539 

412420 AL035668 

412490 AW803564 

413281 AAB61271 

414004 AA737033 

414161 AA136106 

414217 AI309298 



Hs.167347 
Hs.272239 
Hs.624 



gb:zx66h11.r1 Soares_totalJetus_Nb2HF8_ 
gb:ai77h02.s1 Soares.parathyroid_lumor_N 
gb:DKFZp761E2410_rl 761 (synonym hamy2) 
gb:hn49d07.x1 NCLCGAP_Co17 Homo sapiens 



gb:Homo sapiens (clone WR4.12VL) antl-th 
gbiHuman nonspecific crossreacting anOg 
ESTs, Moderdely similar to unknown [H.s 
gb:Human nonspec'ific crossreacting antig 
ESTs, Weakly similar to ALU8.HUMAN ALU S 
lymphocyte act 



diubiquitin 



Hs.17625 

Hs!73853 

Hs.288860 

Hs.34396 

Hs.7155 

HS.184B52 

Hs,279B98 



ESTs, weakly similar to 2115357A TYKl pr 
KIAA15S3 protein 

Homo sapiens cDNA: FU2316S Us, done L 



upregulale stage 
upregulale stage 
upregulate stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 



12.85 0.06 

.55 0.03 

.25 0.05 

.95 0.05 



16.45 0.03 

12.95 0.04 

15.25 0.04 

13.25 0.06 



wo U3/003906 



PCT/US02/21338 



5 

10 
15 
20 
25 
30 



50 
55 
60 
65 



414219 W20010 

414493 AL133921 

414522 AW518944 

414602 AW630088 

414761 AU077228 

416114 AI695549 

416179 R19015 

416391 AIB78927 

416815 U41514 

416980 AA381133 

417258 N58a85 



418791 AA935633 

419146 N99638 

421676 AA2996S2 

422150 AI867118 

422363 T55979 

424673 AA345051 

424848 AI263231 

424865 AF011333 

425053 AFW6024 

425787 AA353867 

426252 BE175980 

426329 AL389951 

427127 AW802282 

427351 AW402593 

427979 8E379776 

428044 AW93322 

428428 AL037544 

428840 M15990 

430191 AI149880 

430589 AJ002744 

430853 AI734179 

431049 AA846576 

431211 M86849 

431341 AA307211 

431639 AKOD0680 

431770 BE221880 

431863 AA188185 



Hs.76325 
Hs.76560 
Hs.77256 
Hs.183868 
Hs.79067 



437192 AW975786 

438000 AI82S880 

439941 AI392640 

440086 HU 005402 

440116 AI798851 

441020 W79283 

441633 AW958544 

441980 AK001441 

442043 BE567620 

442053 R35343 

442271 AF000662 

443303 U87319 

445033 AV652402 

446619 AU075643 

446847 T51454 

446921 AB012113 

448712 W01046 

448772 AW390822 

448926 AI798164 

449962 AA004879 

450139 AK001838 

451035 AU075785 

451334 AI1226gi 

452567 067120 



ALU-fused gene from chromosome 1q 

retinoblastoma-binding protein 2 
Homo sapiens cDI^IA: FLJ23125 fis, clone L 
Homo sapiens mRNA; cDNA DKFZp564B1 264 (f 
3r of zesle (Drosophila) homolog 2 



Hs.7055 

Hs.82643 

H6.282973 

Hs.173030 

Hs.194628 

Hs.1 11496 

Hs.2953 

Hs.1 16474 

Hs.294092 

Hs.145607 

Hs.153563 

Hs.1 54320 

Hs.155029 

Hs.28917 

Hs.271623 

Hs.22265 

Hs.123253 

Hs.181309 

Hs.1 82225 

Hs.184298 



ls.24639 
ls.140903 
ls.187820 
is.296323 



MAD (mothers against decapentaplegio, Dr 
mesoderm specific transcript (mouse) horn 
UDP-N-acetyl-alpha-D-galactosamlne:polyp 
tiigti-mobility group (nontiislone ctiromoso 
ESTs 

F!AD21 (S. pombe) homolog 
general transcription factor IIA, 2 (12k 
hypoxanltiine phosphorlbosyllransferase 1 
CD69 aitigen (p60, eady T-cell activali 
ESTs 

protein tyrosine kinase 9 
ESTs 

ESTs, Weakly similar to ALU8_HUMAN ALU S 



Hs.188809 ESTs 
Hs.246315 
Hs.105676 
Hs.1 03267 
Hs.5566 
Hs.251531 
Hs.266176 



lore (activator 1)3 (38 



ESTs 
ESTs 
nucleoporin 50kD 

pyruvate dehydrogenase phosphatase 
Homo sapiens cDNA: FU22009 lis, done H 
proteasome (prosome, macropaln) subunil, 
RNA binding motif protein 3 
cyclin-dependent kinase 7 (homobg of Xe 
•yes-1 YamaguchI sarcoma viral oncogene 

:sTs 

UDP-N-acetyl-alpha-D-galactosamineipolyp 



hypotlietical protein FLJ22548 similar lo 
gap junction protein, beta 2, 26kD (conn 
proteasome (prosome, maaopaln) subunil 



Hs.268555 6'-3f exoribonuclease 2 
Hs.271871 splndlln 



HS.26S313 

Hs.3459 

Hs.144097 

Hs.75355 

Hs.5985 

Hs.18272 

Hs.288757 

Hs.9403 

Hs!l 12242 

Hs.8055 

Hs.99210 

Hs.24968 

Hs.8180 

Hs.9216 

Hs.155145 

Hs.313 



v-ral simian leukemia viral oncogene hom 



tiypotlietical protein FLJ10579 
ESTs 

Human DMA sequence from clone RP1-233G16 
syndecan binding protein (syntenin) 
caspase 7, apoptosls-related cysteine pr 

ESTs 

secreted phosphoprotein 1 (osteopontin, 
Human done 23815 mRNA sequence 
small inducible cytokine subfamily A (Cy 
Homo sapiens cDNA: FU23602 Us, clone L 

ESTs 

ESTs, Moderately similar lo neuronal thr 



QUiate stage 
guiate stage 



Hs.32597 ring linger protein (C3H2C3 type) 6 
Hs.8895 ESTs 

Hs.274501 H.sapiens ACTH receptor mRNA 3WR 



upregulale stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulale stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulale stage 
upregulate stage 
upregulale stage 
upregulale stage 



wo U3/003906 



PCT/US02/21338 



60 
65 
70 
75 















upiegul^e stage 






























403047 












upregulate stage 














upregulate stage 
upregulate stage 


















AK002081 










upregulate stage 


407980 


AA046309 




gb:2f12101.s1 SoaresJetal_heart_NbHH19W 






upregulate stage 




AW461597 










upregulate stage 


408408 
408920 


AF070571 


Hs.44690 








upregulate stage 




AW600895 




gb;UI-HF-BPOi«ir-a-02-0-Ul.r1 NIH_MGC_5 






upregulate stage 




BE147897 




general transcription factor IIF* polype 






upregulate stage 




AA086219 










upregulate stage 




AF279145 










upregulate stage 




BE241870 




gb:TCAAP2E0M7 Pediatric acute myelogeno 






upregulate stage 




AW857199 




gb:RC2-CT0304-080100-01 1-f06 CT0304 Homo 






upregulate stage 




BE562298 




KIAA01 12 protein; homolog of yeast ribos 


2.1 


0^2 


upregulate stage 




AVVB72477 




gb;hm30f03j(1 NCI CGAP Tf^4 Homo sapiens 














KIAA0077 protein 






upregutate stage 




AVraM36 




gb:QV4-DT0021-281299-07&fl11 DT0021 Homo 




o!i7 


upregulate stage 




AW936S86 




gb:PM2-DT0023-08030W)04-g01 DT0023 Homo 


3.4 


0.16 


upregulate stage 




A1681997 










upregulate stage 




AA1 28978 




Homo sapbns cDNA FU13S94 fis, clone PL 










AW462823 


Hs. 135268 








upregulate stage 




BE154078 




gb:PM0-HT0339-200400-010-F04H'n)339Homo 




o!i8 


upregulate stage 








IGF-II mRNA-blnding protein 3 












Hs .50495 








upregulate stage 




AA210765 




gb:zr90c06.r1 NQ CGAP_GCB1 Homo sapiens 






upregulate stage 




AA225632 


Hs. 190016 












AA 225921 










upregulate stage 




BE513731 


Hs.38959 












































Homo sapiens cDNA FLJ14001 fIs, cbne Y7 


















o!l4 


upregulate stage 




A1224532 


Hs .88550 






















upregulate stage 




AI147535 










upregulate stage 




AW972292 


Hs.292998 








upregulate stage 














upregulate stage 


422429 


AA310527 




gb:EST181333 Jurkal T-cells V Horao saple 






upregulate stage 




BE645072 


Hs.122679 


hypotletical protein FU 10461 






upregulate stage 






Hs.123218 






o!i4 


upregulate stage 




NM_001944 




desmoglein 3(pemptiigus vulgaris antigen 




0.09 


upregulate stage 




AW664214 


Hs.196729 








upregulate stage 






Hs!l69B40 








upregulate stage 








growth differentiation factor 10 


2.15 




upregulate stage 




BE397988 


Hs. 179982 


tumor protein p53-binding protein 






upregulate stage 




AA477989 


Hs.98800 








upregulate stage 




Al 276780 










upregulate stage 




AA204686 


Hs. 234149 


tiypoltietical protein FIJ20647 






upregulate stage 




AK001514 




tiypotfietical protein FU 10652 






upregulate stage 




AA356923 




nuclea' cap binding protein subunit 2, 2 






upregutate stage 




AW971146 


Hs. 293187 








upregulate stage 




AW971382 


Hs.294016 


ESTs, We^ly similar to aipiia-1(XVIil) c 




o!i5 


upregulate stage 




AA504626 










upregulate stage 




AL041844 


Hs.277522 


ESTs, WeaUy similar to FYVE finger-oont 






upregulate stage 




AI378562 


Hs. 1595 85 


ESTs' 










AA863400 
















s. 


iiypotiidical protein f-lJ10846 








433108 


AB002446 




gb:Homo sapiens mRNA from ciiromosome 5q2 


2.35 


o!l4 


upregulate stage 


434163 


AF118072 


Hs.283916 


Homo sapiens PR01716 mRNA, complete cds 




D.14 


upregulate stage 


435202 


AI971313 


Hs.170204 


KIAA0551 protein 


1.25 


0.16 


upregulate stage 


436313 


Al 759400 


Hs.189729 


ESTs 


2 


0.18 


upregulMe stage 


436369 


T60843 


Hs.189679 


ESTs 


3.6 




upregulate stage 


435488 


H57954 


Hs.34394 


ESTs 


2.2 


0.22 


upregulate stage 


436683 


AW293909 


Hs.1 56935 


ESTs 




0.19 


upregulate stage 


436862 


AI821940 


Hs.264622 


ESTs, Moderately similar to ALU8_HUMAN A 


3.2 


0.12 


upregulate stage 


437485 


AI149570 


Hs.127363 


ESTs 


2.05 


0.22 


upregulate stage 


437854 


AL119723 




gb:DKFZp761A2124_r1 761 (synonym tiamy2) 




0.15 


upregulate stage 


438316 


AA789249 




gb:ai27g08.s1 SoaresJestis.NHT Homo sap 


2.45 




upregulate stage 


438390 


A1422017 




gb:tf46f12.x1 NCLCGAP_Bm23 Homo sapien 
ESTs 


3.1 


0.13 


upregulate stage 


438915 


AA280174 


Hs.23282 


1.35 


0.12 


upregutate stage 


439983 


AA8S8394 


Hs.117955 


ESTs 




0,13 


upregulate stage 


442048 


AA974603 




gb:op34f055l Soares_NFLT_GBC_81 Homo s 


5,55 


0,09 


upregulate stage 



wo U3/003906 
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442748 AI016713 

443717 BE163884 

446936 AA287637 

446078 A1339982 

446139 H77395 

446183 AA364991 

448253 H25899 

448966 AK001674 

449199 A1990122 

449558 AA001765 

449676 AW014631 

449859 T98077 

450434 AA166950 

450626 AW970107 

451337 A1400209 

451686 AA059246 

452079 AA830908 

452220 BE1S6006 

453918 AVW)05123 

455350 AW901B09 

456511 AA2e2330 Hs.M 



457427 AW971287 
400296 AA305627 
400409 AF1 53341 
400471 
400641 

400751 



401045 
401049 
401192 
401203 
401205 
401276 
401561 



Hs.135787 
Hs.282331 
H3.167586 
Hs.156061 



Homo sapiens mRNA; cDNA DKFZp761P019 (fr 



ls.157079 KIAA1227pr( 



gb:aVO-NN1020-170400-1954i02NN102a Homo 



Hs.170917 
Hs.139336 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale st^e 



Hs,81360 CGI-35 protein 



406476 
406485 

406741 AA058357 

406867 AA1 67867 

407173 T64349 

407230 AA157857 

407266 AJ236664 

407783 AW996872 

407826 NM_006152 

407870 AB032990 

407877 AW016811 

407968 NM_004863 

408162 AA993833 

408363 NM_003389 

408676 NM.003542 



Hs.172028 
Hs.40202 
Hs.40719 
Hs.234478 
Hs.59403 
Hs.1 18527 
Hs,44396 
Hs,46423 
Hs.184109 
Hs.12727 
HS.S4416 
HS.SS05B 



carcinoembiyonlc antlgen-relaled cell ad 

keratin 19 

gb:yc10d08.s1 Sralagene lung (937210) H 

l<eratin 19 

gb:Hofno sapiens raRNA for immunoglobulin 

a disinlegrin and melalloproteinase doma 

l/mphoid-restricted membrane protein 

iiypothetical protein KIAA1164 

Homo sapiens cDI>IA: FLJ22648 lis, clone H 

serine palmitoyltransferase, long chain 

ESTs 

coronin, adin-tjndlng protein, 2A 
H4 histone family, member G 
ribosomal protein L37a 
Iiypothetical protein FU21610 
sine oculis iiomeobox (IDrosophllaj homolo 
.EH-domaln containing 4 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
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409744 AW676268 

410141 RC7775 

410232 AW372451 

410269 AW613697 

410297 AA148710 

410337 M83822 

410418 D31382 

410541 AA065003 

410724 AW799269 

410786 AW803341 

410968 AA199907 

411162 AW819944 

411173 R81571 

411243 AB039886 



412706 R97106 



419932 AA281594 

420026 A1831190 

420187 AK001714 

420193 AI460080 



Hs.66265 
Hs.287657 
Hs,61184 

Hs,159441 
Hs.62354 
Hs.63325 
Hs.64179 



413587 AA156164 

413800 AI129238 

413859 AW992356 

413930 IVI86153 

413991 H44725 

414052 AW578849 

414203 BE262170 

414343 AL038156 

414564 AA5a7775 

414937 AA524394 

414993 AW819403 



Hs.71573 
HS.732B7 
Hs.73291 
Hs.73454 

Hs,74126 
Hs.201262 
Hs,167546 
Hs.76064 

Hs,76355 

Hs,271833 



Hs, 165544 
Hs,77724 
Hs,78353 



416012 AF061959 Hs.78961 

416074 R40174 Hs,2l209 

416182 Nllfl_004354 Hs.79069 

416518 H60730 Hs.18917 

416782 L35035 Hs,79886 

416987 D86957 Hs.80712 

417094 NM.006896 Hs,81182 

417275 X63578 Hs.81849 

417395 BE664245 Hs,82084 

417683 AW566008 Hs,239164 

417759 R13567 Hs,12548 

417848 AA2C6581 Hs,3g457 



418636 AW749866 

418786 A1796317 Hs,203694 

418827 BE327311 Hs,47166 

418948 AI217097 

419551 AW582256 Hs,gi011 



419713 AW968058 
419720 AA249131 
419791 AI579909 



Homo sapiens mRNA; cDNA DKFZp586P2321 (f 
Homo sapiens cDNA: IU21291 lis, clone C 
CGI-79 protein 

gb;hh79g12.i<1 NCI.CGAP.GU1 Homo sapiens 

ESTs 

cell division cycle 4-lil<e 
transmembrane protease, serine 4 
liypotlieiical protein 

gb:RCO-UM0051-210300-012-f01 UM0051 Homo 
gb;IL2-UM007&<)90300-050-D03 UM0079 Homo 



CA11 

gb:ye87a07.r1 Soares fetal liver spleen 

liypothetlcal protein FU10074 

KIAA123S protein 

hypothetical protein FU10881 

troponin T3.sl<eletal, fast 

gb:QV1-DT0069.1 1020(M)67-d06 DT0069 Homo 

fatty acid binding protein 6, Ileal (gas 

ESTs 

ESTs 

tubulin-specifio chaperone c 

gb:seq224S HMSWMYK Homo sapiens cDNA do 

ubiquitin-conjugating enzyme E2N (Inmolo 

gb:MR0-HT0067-201099-002-d10 HT0067 Homo 

Homo sapiens cDNA FU 13473 fis, clone PL 

ESTs 



ESTs 



ns HSPC311mRNA,partidcds 



gb:y(53a04,r1 Soares Infant brain 1NIB H 
gb:HSC1 RF051 normalized infant brain cDN 
gb:HSC38B051 normalized infant brain cDN 
gb:yh19g02.r1 Soares placenta Nb2HPHomo 
protein phosphatase 1, regulatory (Inhib 
ESTs 
cycllnG2 
ESTs 

ribose S-phosphale Isomerase A (ribose 5 

KIAA0202 protein 

histamine N-methyltransferase 

parvalbumin 

inlegrin beta 3 binding protein (bela3^ 

■ isoDNAFU12814lis,cioneNT 



ESTs 
ESTs 



HS.166675 
H5.9S744 
Hs.202869 



crystallin, zeta (quinone reductase) 
EST 

Kruppel-lil(e factor 5 (intestinal) 
cytolieratin 20 

Homo sapiens c»IA FU1 2577 lis, done NT 
gb:aV4-BT0534-281299-053^:05 BT0534 Homo 
Homo sapiens uncharacterlzed gastric pro 

HT021 

gb:qd43lt07.x1Soares_fetal_heartNbHH19W 

anterior gradient 2 (Xenepus laevis) tiom 

poly (ADP-ribose) giycohydrolase 

FXYD domain-containing ion transport reg 

Homo sapiens cDNA: FU22995 fis, ctane K 

nudix (nucleoside diphosphate linked moi 

hypothetical protein FLJ1 1068 

ESTs 

ESTs 

ESTs 

gb;zt03a01,r1 NCi_CGAP_GCB1 Honw sapiens 



upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulalf 



i stage 



upregulalt 

upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 
upregulale stage 



late stage 



upregulale stage 
upregulale stage 
upregulale stage 
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35 
40 
45 



70 
75 





A1623693 


Hs. 191533 












AW295840 




Homo sapiens cDNA: FLJ21513 fis, clone C 














uropaki 






































AA419392 


Hs. 173354 


























M62397 










































AW1 17261 


















gb:zs52c10.r1 NCI CGAP GCB1 Homo sapiens 










NM 004833 


Hs.105115 








upregulate stage 


421634 


AA437414 


Hs;i06283 


hypotheSc™(SVui0262 


7.79 


0.08 


upregulate stage 








neuronal PAS domain protein 2 






upregulate stage 




AK001718 


Hs.108530 


hypothetical protein FIJ10856 
ESTs 






upregulate stage 


421655 


F06504 


Hs!27364 


Z9 


o!l6 


upregulate stage 




AA259011 


Hs.109268 


hypothetical protein FU12552 












Hs,300893 


ESTs, Weakly similar to envelope protein 












Hs.118281 












AW803733 






















































423081 


AF262992 




sperm associated antigen 4 


2]82 


o!3 


upregulaleslage 


423596 


AA328196 




ESTs, Weakly similar to CTL1 protein fi. 


2.75 


0.19 


upregulate stage 
















424005 


AB033041 


Hs!l37507 


KIAA1215 protein 


171 


0:37 


upregulaleslage 


424014 


AA333653 










































































ESTs, Weakly similar to hSIAH2 [H.sapien 




























SWAP-70 protein 










AL049280 


Hs. 155397 


iiomo sapiens mRi^; cDNA DKFZp564K143 (fir 






upregulate stage 




NftA_001241 


Hs. 155478 












W16480 


Hs.9701 3 
Hs.24283 








upregulaleslage 
upregulate stage 




AC002115 


Hs.159309 








upregulate stage 
















426088 


AF038007 


Hs.166196 












NM_00291 3 


Hs. 165563 








upregulate stage 






HS.16B102 










426603 


M382291 




gb:EST95683 Testis 1 Homo sapiens cDNA 5 


l!8 


0^26 


upregulaleslage 




NM_015865 














NM_006379 












426902 


A1125334 


Hs.97408 












NM 003416 




zinc finger protein 7 (KOX 4, clone HF.1 










































chromosome 21 open reading frame 1 1 


















































427737 


AA435988 


Hs!l78066 


ESTs, Weakly similar to AF0682a9 5 HDCME 


5.7 


o!ii 


upregulate stage 


428042 


AA41962g 




gb:zv03d12.r1 Soares_NhHMPii_S1 Homosapi 




0.14 


upregulaleslage 


428336 


AA503115 


Hs.183752 


mlcrosemlnopioleln, liela- 


5'.37 


0.05 


upregulate stage 


428337 
428365 


AA644508 
AA296331 


Hs.183861 


gb:af73c01.r1 Soares_NhHMPu_S1 Homosapi 
Homo sapiens cDNA FU20(H2 fis, clone CO 


3.4 


0.13 


upregulate stage 
upregulate stage 




X57348 


Hs.184510 


stratilin 


1.81 


0.39 


upregulate stage 


428683 


AA430589 


Hs.301374 


ESTs, Moderately similar to ALUS_HUMAN A 


7.55 




upregulate stage 


428670 


AA431682 


Hs.134832 


ESTs 


8.05 


o!i 


upregulaleslage 


428785 


AI015953 


Hs.125265 


ESTs 




0,15 


upregulate stage 


429332 


AF030403 


Hs.199253 


Ste-20 related kinase 


2.64 


0.26 


upregulate stage 


429343 


AK000786 


Hs.199480 


opsins 
ESTs 


3.15 


0.27 


upregulate stage 


429566 


AW139399 


HsS8988 


1.87 


0.31 


upregulate stage 


429663 


M68874 


Hs.211587 


phosptiollpase A2, group IVA (cylosallc, 


0.61 


1.02 


upregulate stage 


429824 


AA296363 


Hs.121520 


Human BAC done GS1-99H8 


2.03 


0,39 


upregulaleslage 


429966 


BE081342 


Hs.226799 


HSPCOag protein 


7.85 


0,08 


upregulaleslage 
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429970 AK000072 

430042 AB023170 

430168 AW968343 

430308 BE540865 

430399 AI916284 Hs.199671 

430763 AA486468 Hs.106668 

431474 AL133990 Hs.190642 

431567 N51357 Hs.260855 



431912 A1660552 

432350 NM_005865 

432620 AI076978 

432524 AI4S8020 

432540 A)821517 

432623 AA557351 

432632 AW973801 

432820 A1654057 

432945 AL043683 

433027 AF191018 

433037 NM.014158 

433156 R59206 

433171 AA579425 



433409 AI278802 

433650 AA603472 

433675 AW977653 

434328 BE564937 

434476 AW858520 

434683 AW298724 

434726 AF062719 

435124 AA725362 

435563 AF210317 

435899 W89093 

436026 A1349764 

436154 AA764950 

436293 A1601188 

436361 AA825814 

436455 A1C27959 

436577 W84774 

436684 AW976319 

437036 A1571514 

437146 AA730977 

437262 BE250537 

437277 AA748016 

437882 AI243203 

438392 AA806395 

438416 N76398 

438739 AA816391 



chloride channel, calcium actlvalad, fam 
KlAA0953proyn 

ESTs, Highly similar to AF1281 13 1 promi 
cyclln-dependent kinase Inhibitor IB (p2 



Homo sapiens mRNA; cDNA DKFZp761G2311 (f 
hypothetical protein F1J10326 
hypothetical protein FU10450 
uncharaclerlzed hyi 



ESTs, Wealdy similar to A56154 Abl subst 



439211 

439394 AA149250 Hs.56105 

439644 W26354 Hs.28891 

439569 AW602166 Hs.222399 

439586 AA922936 Hs.1 10039 

439706 AW872527 Hs.59761 

439897 NM_015310 Hs.6763 

439898 AW505514 Hs.209561 



440035 BE551589 

440619 AW408585 

440635 AW610331 

440787 AW292043 

441233 AA972965 

441528 A10C3797 



141670 I 

441683 BE554214 

441847 AI215564 

442145 A1022650 

442299 AW467791 

442315 AA173992 

442528 AF150317 

442671 0)6338 

442607 AA507576 



Hs.1 88007 ESTs 



ESTs, Moderately similar to PUR6_HUMAN M 

ESTs 

ESTs 

ESTs, Wealdy similar to unnamed protein 
putative nucleotide binding protein, est 
HSPC067 protein 

Homo sapiens cDNA: FU22539 fis, clone H 
gb:nf37c08.s1 NCI.CGAP_Pr2 Homo sapiens 
gb:nv16h12.s1 NCI CGAP_Pr22 Homo sapiens 
double-stranded RNA specific adenosine d 

ESTs 



nscDI^: FU21904 fis, done H 



ite carrier family 2 (facilitated glu 



;.149065 ESTs 



ESTs 

ESTs 

Homo sapiens cDNA: FLJ23068 fis, clone L 
gb;ai51c02.s1 Soares.testis.NHT Homo sap 
Homo sapiens cDNA: FLJ22785 fis, clone K 
ESTs, Wealdy similar to WDNtuI I^T WDNIVll P 
hypothetical protein FU11360 
CEGP1 protein 
ESTs 
ESTs 



Hs.209433 ESTs 



hypothetical protein FLJ21839 
ESTs, Ivloderately similar to ALU5_HUMAN A 
gb:RC4-ST0316-190100-011-c08 ST0316Horao • 



Hs.220972 ESTs 



erbb2-interacting protein ERBIN 



upregulate stage 
upregul ate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 
upregulats stage 



Z15 0.39 upregulate stage 

5.1 0.11 upregulate stage 

7.9 0.1 upregulate stage 

3.54 0.14 upregulate stage 

6.6 0.08 upregulate stage 

245 0.21 upregulate stage 

4.75 0.1 upregulate stage 
1.6 0.18 upregulate stage 
3.88 0.17 upregulate stage 
3 0.15 upregulate stage 
4.6 0.1 upregulate stage 
21 0.19 upregulate stage 

1.76 0.34 upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 



upregulate stage 
upregulate stage 
upregulate stage 
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R40800 


Hs,21303 


ESTs 




0.08 


upregulale stage 


442993 


BE018682 


Hs .44343 


ESTs 




0.34 


upregulale stage 


443015 


R33261 


Hs.6614 


ESTs 




0.09 


upregulale stage 


443085 


AI032660 


Hs.164711 


ESTs 






upregulale stage 


443228 


W24781 


Hs.293798 


ESTs 




0.47 


upregulale stage 


443357 


AW071349 




ESTs 


175 


0.29 


upregulale stage 


443371 


Al 79 2888 


Hs. 145489 


ESTs 


6.86 




upregulale stage 


443564 


AI921685 


Hs. 199713 






o!l8 


upregulale stage 




AW028596 




ESTs 


a25 


0.15 


upregulale stage 






Hs.293776 


ESTs, WeaWy siinila- to 1207289A reverse 


6.45 


0.11 


upregulale stage 


443861 


AW449462 


Hs.134743 


ESTs 


6.72 


0.09 


upregulale stage 








ESTs 


4.25 




upregulale stage 




AB018249 




small inducible cytokine subfamily A (Cy 


8.45 


o!o9 


upregulale stage 


444184 






Human DMA sequence from clone RP1-28H20 


ai 




upregulale stage 




BE278964 


Ms. 11 085 


CGI-111 protein 




olo9 


upregulale stage 


444624 


AV650476 


Hs.282936 


ESTs 


1.SI 




upregulale stage 




AW995395 


Hs.84520 


ESTs 


1.25 


0.21 


upregulale stage 




A1188613 


Hs.143866 


ESTs 


21 


0.21 


upregulale stage 




BE019923 




hypotlietical protein FU 13057 sMIar to 


6.8 


0.1 


upregulale stage 




A1192105 






0.94 


0.6 


upregulale stage 




BE262989 




putative protein 


8.09 


0.1 


upregulale slage 






Hs.208275 


ESTs, Wea)<1y similar to unnamed protein 


4.6 


0.09 


upregulale stage 




A) 200759 




ESTs 


6.85 


0.11 


upregulale slage 




Ai651039 


Hs.143559 


ESTs 


3.15 


0,18 


upregulale slage 




A1206888 




ESTs 


7.81 


0.09 


upregulale stage 




AW189787 






2 


0.07 


upregulale slage 




A)936450 






2.65 


0,12 


upregulale stage 




AA503887 




Homo sapiens cDNA: FU21362 lis, clone C 




0.46 


upregulale slage 




AW058463 




zinc-fingers and homeoboxes 1 


ii 


0.07 


upregulale stage 




BE410347 


Hs!l3063 


transcription fader CA150 


3.8 


0.15 


upregulale stage 


445817 


NM.003642 


Hs.13340 


Hstone acetyllransferase 1 




0.1 


upregulale slage 




AI702901 


Hs. 1465 82 


ESTs 




0.33 


upregulale slage 


446140 


AA356170 




Homo sapiens cDNA: FLJ21908fis, clone H 


216 


0.18 


upregulale stage 




AB021179 


Hs!l5299 


HIvlBA-inducible 


2.65 


0.18 


upregulale stage 




AA393907 


Hs.97179 


ESTs 






upregulale stage 


447086 


AI421397 


Hs.161321 


ESTs 




o!i 


upregulale stage 




AI476732 


Hs.263912 


ESTs 


2!35 


0.18 


upregulale stage 




AI554946 








0.09 


upregulale stage 


447390 


X95384 


Hs!l8426 


translational inliibltor protein p14.5 


7^26 


0.08 


upregulale stage 




NM 004786 




tliioredoxin-lil(e, 32I<D 




0.24 


upregulale stage 


447648 


N53"388 


Hs!7222 


ESTs 


8.6 


0.07 


upregulale stage 




AA373527 


Hs.19385 


CGI-58 protein 


7.3 




upregulale stage 


447863 


A1434204 


Hs.164285 


ESTs, Weal<ly similar to Afg 1 p [S.cerevis 


6.75 


Oil 


upregulale stage 




AA081218 


Hs.58608 


ftomo sapiens cDNA FLJ14206 fls, clone NT 






upregulale stage 




AW292577 
AI459306 


Hs]94445 
Hs.24908 


ESTs 
ESTs 


3X 


013 
0.11 


upregulale slage 
upregulale slage 


448474 

448513 


AI7g2014 


Hs.13809 


ESTs 


272 


0.28 


upregulale slage 


AA344741 


Hs.61773 


Homo sapiens cDNA FU11648 fis, clone HE 


4.8 


0.12 


upregulale slage 










2.65 


0.2 


upregulale stage 




AW970766 


Hsil 78470 


Homo sapiens oDNA: FLJ22662 fis, clone H 


1.68 


o!i9 


upregulale stage 




AW473830 


Hs.171442 


ESTs 


2.95 


upregulale slage 




A1571940 


Hs.7549 


ESTs 


2.3 


0.14 


upregulale slage 




AW408009 


HS.2258Q 


all<ylglycerone pliosptiate syntliase 


8.6 


0.08 


upregulale slage 


449448 


D60730 


Hs.57471 


ESTs 




0.13 


upregulale slage 


449617 


AW600105 
A1655321 


Hs.23543 
Hs.197693 


serine/threonine protein Idnase IWASK 


6.4 


o!l6 


upregulale slage 


449659 


A1655992 
R60031 


Hs.300647 
Hs.198899 


ESTs 

eultaryotic translation Initiation factor 


8.35 


0.09 


upregulale slage 
upregulale slage 


449689 


AF228421 


Hs.301039 


Human Dl^se(iuence Ironi clone IRP1-132F21 


8.35 


0.06 


upregulale slage 




A1674072 




gb:wd15h01.x1 Soares_NFLT_GBC_S1 Homos 


5.8 




upregulale slage 




AW001741 


Hs.273193 


hypoihetical protein FIJI 0706 




o!o9 


upregulale slage 




A1685365 




ESTs 






upregulale slage 


450193 


A1916071 


Hs!224523 


ESTs 


5.73 


o!i 


upregulale slage 


450336 


AA046814 


Hs.288928 


Homo sapiens cDNA: FU23296 fs, clone H 


8.2 




upregulale slage 


450341 


N90956 


Hs.17230 


hypollietlcal protein FU22087 


4.2 




upregulale stage 


450353 


AI244651 


Hs.103296 


ESTs 




0.15 


upregulale slage 


450737 


AW007152 


Hs.203330 


ESTs 


6 " 


0,25 


upregulale slage 


450795 
450928 
451134 


A1744417 
AA318315 


Hs.60435 
Hs.25999 


ESTs 

gh:tr10h12x1 NCLCGAP_Ov23 Homo sapiens 
hypothetical protein FU22195 


1.75 
4.3 


o!i8 


upregulale stage 
upregulale slage 
upregulale slage 


451230 


BE546208 


Hs.26090 


hypothetical protein F1J20272 


4.76 


o!i6 


upregulale stage 


451593 


AF151879 


Hs.26706 




5.8 




upregulale stage 


451618 


AA1 15639 


HS26764 


K1AA0646 protein 


5.8 


o!l3 


upregulale stage 


451668 


Z43948 


Hs.26739 


hypothetical protein F1J10320 


0.73 


0.26 


upregulale stage 


461790 


AA927403 


Hs.43897 


ESTs, Weakly similar to P2CA_HUkJAN PROTE 


3.2 


0,25 


upregulale slage 


452001 


AI827675 


Hs.297735 


Homo sapiens cDNA: FLJ22094 Is, clone H 
ESTs 


3.7 


0.13 


upregulale stage 


452039 


AI92298B 


Hs.172510 




0.65 


upregulale slage 
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452046 


AB018345 


Hs.27657 


KIAA0802 protein 


1.13 


0.39 


upiegulate stage 


452092 


BE245374 


Hs.27842 


hypothetical protein FLJ11210 


3.2 


0.15 


upregulale stage 


462278 


AL037716 


Hs.28785 


Homo sapiens mRNA; cDNA DKFZp586F0219 (f 


8.25 


a07 


upregulate stage 


452381 


H23329 


Hs.290880 


ESTs, Weal^ly similar to ALULHUMAN ALU S 




0.34 


upregulale stage 


452420 


BE564871 


Hs.29463 


centrin, EF-hand piotan, 3 (CDC31 yeasi 




0.13 


upiEgulate stage 


452714 


AW770994 


Hs.30340 


hypothetical protein KIAA1165 


7.6 


0.09 


upregulale stage 


453078 


AF053651 


Hs.31534 


melaxin 2 


5.3 


0.09 


upregulate stage 


453370 


A1470523 




ESTs, IMerately similar to translation 




0.13 


upregulale stage 




BE279901 




hypothetical protslnFU 10775 


3.95 


0.11 


upregulate stage 


453972 


AW1 37224 


Hs.245869 


ESTs 




0.09 


upregulale stage 




AW022393 




gb:df37h12.y1 Morion Fetal Cochlea Homo 


1.15 






464289 


AL1 37554 


Hs.49927 


Homo sapiens mRNA- cDNA DKFZp434H172C [f 


7.05 


0.1 


upregulate stage 


454314 


AW364844 




gb;QV3-DTC044-221299fl45-c03 DT0044 Homo 




0.37 


upregulate stage 


454315 


AW373564 




nuclear pore connplex interacting protein 






upregulate stage 


454775 


BE160229 




gb;QV1-HT0413-090200^62-a12 HT0413 Homo 


8.5 


0.09 


upregulate stage 


454790 


AW820852 




gb:RC2-ST0301-120200^11-f12 ST0301 Homo 


1.15 




upregulate stage 












012 




455170 


AW860972 




gb:QV0-CT0387-18030W67-h07 CT0387 Homo 


6.7 


0.07 


upregulate stage 


455511 


BE144762 




gb:CM0-HT018O^)41099-065-b04HT018O Homo 




0.26 


upregulate stage 




AI751357 


Hs.288741 


Homo sapiens cOm. FU22266 fis, clone H 


8.36 


0.09 


upregulate stage 




AW976410 


Hs.269069 


Homo sapiens cDNA: FU21016 fIs, done C 


4.86 




upregulate stage 


456279 


AW006783 




ESTs 


7.25 






457518 


AA825350 


Hs!l 43805 


ESTs, WeaUy similar to ALUOtJMAN ALU S 




0.11 


upregulate stage 


457570 


AA579426 


Hs.1 90226 


ESTs 






upregulate stage 


457982 


AW856093 


HS.1B3617 


ESTs 




6.25 


upregulale stage 


458080 


BE142728 




gb;MRO-HT0157-021299-004J08HT0157 Homo 


Z05 


0.27 


upregulate stage 


458340 


A1457102 


Hs.121583 


2.25 


0.18 


upregulate stage 


458440 


AI095468 


Hs.135254 


ESTs, Weakly similar to thromliaspondin t 
ESTs 


2.35 


0.13 


upregulate stage 


458771 


AW295151 


Hs.163612 




0.19 


upregulate stage 




AA722012 


Hs255757 


ESTs, Weakly similar to KIAA0611 protein 


6.95 


0.1 


upregulate stage 



TABLE 1B 



Pkey: Unique Eos probeset Identifier number 
CAT number: Gene cluster number 
Accession: Genbarl< accession numbers 



AA084958 AA214034 AA044587 AA130152AA130116 D81924 
W05608AW118362AW1 96215 
AA046309 AI263600 AA046397 

AW175997 AW176000 AW175999 AW175994 AW176004 AW176989 
AWl7654eAW178965BE141057 
AL138247AA053S29BE173313 
AW177969AW177960AW178066AW178022 

BE141991 AW17B855 BE141990 BE141993 BE141994 BE062405 BE062404 AW178850 
AAD69998 AA076669 AA075660 AA069828 

AW387837 AW387874 AW387826 AW3B7758 AW387864 AW809268 AW387754 

AA078899 AA078782 AA075788 

AA302840 T93016 T92960 AA077661 

AW885691 BE185989BE185940AW500322H10282 

AW500896AW937549 

AW502122 AW50212S AW601663 AW501720 
AW582668AW818666AW818647AW818655AW818637AW818234 
AW904699 AA325920 AW964919 AA081786 T05150 
AW613S97 BBB80236 BE080168 

AW749927 AW749938 AW74g986AW749966 R87124AW860547 

N95428 W24040 AW751366 HB19B7 

AW752710 BE180336BE180186 

AW905138AW753008R13818Z43519 

AW753108 AW852909 M36993 AW894327 AW903629 

N29220AW754225AW754224 

AW858528AW754362H74000 

AW799269 AW799386 AW799501 AW799386 AW799270 AW799467 

AA357918AA089516AA357917 
T53840AW801569AW801568 

AW803341 AW803266 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AWB03334 AW803355 

AW807073 AW807055 AW807067 AW807276 AW807030 AW807363 AW845892 AW807091 AWB07275 AW807284 AW807287 AW845891 

AW8C7195 AW807271 

AW810204 AW810666 AW810196 AW810619 AW810607 
AW812278 AW812286 AW812274 
AW812877 AWB12878 AW812952 AWB12880 
H93000AW818022AWB18044 
AW816156AWB65743AW86S702AW86569B 

AW819191AW819252AWei91B3AWB19175AW819177AW8l91B6AWB1 9180 BE158470AW819242AWai9269AW81 9244 AW819190 
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411170 
411173 
411193 
411242 



412134 
412178 
412296 
412303 
412305 
412312 
412327 



412389 
412547 

412569 
412671 



413618 
413707 
413743 
413782 
413792 
413B04 
414091 
414203 
414220 
414276 



1233775J 
1234099J 
1234102J 
1234161J 
1234379 1 
123446.1 
1235254.1 
1236393.1 



AW819266 AW819268 AWeig246 BE162602 AW819249 AW8ig251 AW819263 AW819194 

AW819455AW819711AW819553 

AW819867 AW819956 AW820159 

AW820178 AW820183 AW819870 AWB19937 AW835709 AW820184 AW820157 AW819945 AW820096 
AW319944 AW820182 AW82016B AW819959 AW819953 
AW820503AW820306AWe20429BE174741 AW820244BE174738 
R81571AW9B2014AA324277 

AW821484 AW821461 AW821490 AW821525 AW821526 AW821519 

BE146808 BE1 46949 AW833828 BE1 47002 BE147066 AW833422 AW833475 AW833485 AW83341 4 AW833406 AW833395 AW833640 
AW833836 AW833421 BE1 46805 AW833465 BE146753 BE147004 AW833473 AW833673 BE147068 AW833466 AW833680 BE147063 BE147058 
BE146807 AWB33830 AWB33694 AW833596 AW833601 



AW837676 AW837356 AW837581 AW837348 AW837681 



411245 1236412 1 AW833441 AW833652 AW833700AW833610AW833673AW833675 

411282 1237660.1 AW995011 AW880630 AW996662 AW88C196 AW860455 AW995379 AW880634 AW938253 AW880580 AW938263 AW880684 AW860464 
AW880603 AW880026 AW938265 AW880592 AW852203 AW880588 AW880680 AW9ffi392 AW88059B AW860503 AWS80590 AW880175 
AW846S34 AW88Q0g8 AW88010B AW860453 AW93B051 AW99Sigi AW88QS32 AW880542 AW93B197 AW994929 AW880635 AW835438 
AW938057AW938aS4 

411327 
411336 
411339 

411407 1244071.1 R00903 R20424 AW844169 

411418 124518 1 BE241870AI752S84Z19223AA307961 AA1B1165AA0g3272AW368570 

411425 1245503.1 AW846012AW846007 AW845995 AW845975 

411461 1246730 1 AW847937 AW847672 AW847809AW847923 AW847808AW847616 

41 1 474 1 247047 J AW848427 AW848890 AW848159 AW8481 1 8 AW84B634 AW848285 AW848086 AW848485 AW848283 AW84B1 62 
AW850327 AWB50350 AW85034B AW850375 

AW851186AW996967BE143456 
BE144593 AW85 1 632 AW968149 AW9681 44 W21059 

AW857199 BEQ6601B BE066982 BE066082 BE065949 BE066Q19 BE066081 BE065954 BE066084 
AW858443 AW868436 AW861923 AW858561 AW861B72 AWB68474 AW861904 AW858398 
AW859650BE071806AW859655 
AW993247 AW861464 
AW86356B BE151696 BE161629BE161B24 
AW897793 R20054H23321 

AW947946 AW947986 AW947977 AW8651 87 AW865246 AWB651 82 AW865376 AW865392 AW865184 

AWB66330 AW866410 AW866332 AW86641 1 AW886327 AWB66326 AW866335 AW866334 AW8B6331 AWB66333 AW866328 AW866329 
T89420N87S10AA095932 
AW872477BE0BB101T05990 

AWB76648 AW876577 AW876528 AW876623 AW87651 9 AWB76540 AW876569 AW876534 AW876563 AW875545 AW876574 



411560 1249443.1 

411568 1249776_1 

411591 1254166.1 

411727 1266Q52J 

411741 1255976 J 

411771 1257335.1 

411787 12S8789J 

411788 125879.1 
411826 
411840 
411860 
411880 



1263110.1 
1266125.1 
1279147.1 
1281471.1 
1288043.1 
1288130.1 



12a8935u1 
1290342.1 
1290611.1 
129085.1 
1291505.1 



1305813.1 
1306011.1 

1319959.1 
1324622.1 
1331178 1 



AW898526AW898525Z19700 

AW936233AW936272 

AW936336AW936339 

AW936369 AW936377 AW936355 AW936410 AW936460 

AW936686 AW936656 AW93660B AW936635 AW936662 AW936679 AW936726 AW936602 AW936733 



A\AI93gi48 AW93g200 BE161819 
AW939537AW939684 W92705 
AW837985AW837938AA101955AW837913AW837935 

AW945964 AW946020 AW946034 AW946027 AW946041 AW946044 AW946033 AW946024 AW946021 AW946029 AW946015 AW946016 

AW94603g AW946045 AWg4602B AWg46036 

AW947655AW984020 

W27161 AW961828 

T31474 T31 123 Z46146 AWg62040 

AW977734 H52048 

N31101 N46491 AW994084 

D61870D61866BE002674 



413141 1350477 1 

413266 1356260.1 

413294 1359149.1 

413342 136279.1 

413402 1366932.1 



1389912.1 
1390710.1 
141621.1 
1425510.2 
1426940.1 
1432115.-1 



BE300352 BE299274 BE075351 BE297444 
BE144034 N72932BEOB0176 
AA128536 AW973587 AA563822 

T24065BE092527BE092528BE092204 BE092271 BE092516 BE092202 BE0922B0 BE092065 BE0919gi BE092284BE092514 

BEG91995 BE092063 BE09206B BEQgi994 BEQ92283 BE392447 

BE141022BE141513BE141532 BE141488 BEMMSI BE141526BE141516BE141471 BE141478 BE141476 BE141475BE141021 

BE141533 BE141464 BE141490 BE141472 BE141480 

N52628BE143639 BE143706 

BE145354BE145429 BE145418 

BE154078BE1 64006 BE163891 BE153523 BE153983 

BE158679BE158714 

BE161004 BE162497 BE151135 BE160999BE162492 
BE546104 BE165710 BE382801 

BE166924 BE166g21 BE166925 BE166915 BE166970 BE166968 

T64682BE168190BE168256 

T83742AW672BB2AA13537B 

BE262170 BE382553 BE261026 BE273627 

BE298094BE267860 
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1511216J 
1515978_1 
1522649_1 
1523036_1 
1527354_1 
1533528 1 



1539911J 
1639951.1 
1540373J 
1540758.1 
1540853.1 
155215.1 
1554447.1 
155791.1 
156264.1 
1574973.1 
1589633 J 
1592777 1 
1604694.1 
166556_1 
1674485.1 
1696282 1 
171237.1 
1736343.1 
175699.1 
1762961.1 
176677.1 
176907.1 
177402.1 
1784125.1 
180808.1 
1816038 1 
182217.1 
185148.1 
185172.1 
187390.1 
188248.1 
188252.1 
188386.1 
189159.1 
1B9768.1 
190755.1 
192170.1 
195293.1 
197723 1 
200885_1 
201378.1 
201903 1 
202288.1 
205732.1 
207654.1 
213258 1 
215498_1 
216469.1 
2171 60_1 
218192.1 
221034.1 
221906.1 
224988.1 
225175.1 
227560_1 
230430.1 
230585 1 
232362.1 
232478 1 



BE549143 BE390613 BE277344 
T07114BE5436B8 
C17372D63180 D789Q8D63252 
D31010D30991 D31168D31166D31465 
D60075 D6Q259 D51037 
D60468 D6Q950 D61218N98936 
D80503 D81739 N95691 
F05086F05Q91 R17158 
R1 1813 Ml 64376 T56632 
F06565 Z43486 R1 8417 F06477 FQ6476 F07098 
F06771 H04B95 Z4277B 
R15239Z45189FQ6836 
Z44067F07617 R34555 
Z43930F12170H10BB1T65261 
F12294F12213T66272T66261 
F12664 T74312 N80318Z20039 
Z43847F13068 T75331 
F13168R21289 T77628 
AA167712 AW936024 AW364438 AW364446 
R21651 H00542 
D59243 D63202AA169716 
AA262226AI984175AA171807 
R52782R17313H24192R19876 
H49213 W90725 H49398W90726 
R94575 T99BB6 H52989 
H69044T47667H75691 T50292 
H15635H16691 AA1955Q6 
T98987T96744 N783Q6 
R64719Z44680R12461 
AA2107B5T95700 H94407 
R91254 T97156 R58711 



423632 
423644 
423818 
423837 



423912 233241.1 



T76248T88763 R93361 
AA224827 T59708 T59843 BE156903 
AA225188 AL157508 AL157509AA226189 
AW749855 AA225995 AW75Q208 AW750206 
Z42183 T31621 T97478 

AI21 7097 AW886090 W38035 W38792 AA23283S AW936043 
T86216T86117T83316 

N99638AW973750AA328271 H9C994AA558020 AA234435 N59599 R94815 

AW978038AA804204AA243400 

AA826279 AA243426 AW971614 AW967805 AW971605 

AA953770AA249091 

AW966946 AA250984 H91733 AW966941 

R77402 AA262462 AA250988 R06794 

AA2S1139 AW967485 AA2S1204 

AA281594AI365043AA251996 



AA257990 AW81 6460 A141 6981 AW500873 
W87731 AA279004AA279001 
AA281809 H89487 N46537 
AA81 1804 AA809404 AA286907 AW977624 
AA2B7443 AA419385 BE084078AI478347 
AA361752 AW963276 AA288017 
AA806639 AA291 008 AA836274 AW978806 

AW754341 AW858420 AW858475 AW861969 AW861925 AW858554 AW85861 6 AWB58450 AW858545 

BE048255AA313083AA298419 
AA339015AW962948AA306188 
AA30g272AA309312AW961837 
AA310527 AW962295 Z44865 H06641 
AA311407AW958321 N23583R70050 
AA312730 AW963285 W28250 W27318 
BE40g561 BE1 62766 AW732798 
AA318334AW961457AA317752 
AA323114AA321992BE161381 BE161392 
AW864848 AA32221 3 AA322209 AW961 624 
AA364195AA32502gAW962050 
AA32B824 AW961410 AW958096 
AA329048AW962484 T83628 
AA332439 AA332579 AA33147Q AWg62539 

AW937063AW937056AW937062AW937059AW937057AA331599AW937061AW937093AW937094AW937058AW851327BE 

AW752886 AW937Q55 AW937054 

BE091233 BE080904 BE079414 BE075356 AA332482 

AA339646 AA339189 AW966286 

AW854298 AA343691 HS0917H5G907 H50938 

AW964082AA348838AA348B39 
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424993 245782_1 F07625 R36115AW953115F06102H13361 AA349497F11152M707254 

425360 25Q631_1 BE547704AA355909 

425362 250655J AA355936 AI741379 

425672 254716 1 M361483A1557239 

426356 26538L1 BE535836 M376153 

426476 267736_1 AL1 34728 M379975 M379440AW963931 

426561 2691 58 J AA381437 AA628833AW407275 

426603 269825_1 AA382291 AA994657 

427259 276594 1 AA400096 AA400167 

427566 2804Q1_1 AI743515 AA405617 AW276706 

427980 285225_1 AA418305 A1264351 

428042 286292_1 AA419529H970B9H96977 

428192 288Q21_1 AA424051 AW104616 

428337 289967.1 AA644508AA479489 AA426174 

428436 291472.1 BE080180 AWB27313 AW231970 AA995028 AA428584 AWB72716 AW892508 AW854S93 AA57B441 AW975234 AA664937 AA984131 

AA52B743 AA552874 AA564758 AW06324S AI267S34 AW070190 AW893483 AA770330 AA906928 AA906682 AA758746 AA651717 AW06331 1 
AM29538 

429236 301580.1 AA448407 AA44S404 AA448424 AA448426 

429917 31049.1 H80572 AL1 09691 H80520 

429979 311517.1 AA463338 M86137 M85945 A1040226 

430076 31295?.1 AA465115AW967750 AW869525 

430350 316401.1 BE169639AA476976 

430561 319932.1 BE065227 BE06S366 BE065182BE065184AA48123g 

430640 321094 1 AA482636R1 6336 R1 6343 R60706RS3647 

430795 323756 1 AW971398AA5D7162AA4B6494 

431319 331474.1 AA873350 T82429 T82428 

431322 331543.1 AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

431670 336353 1 AW971287AA524976AA513479 

431828 338201.1 AA572994 AA616249 AA702595 

432093 341283_1 H28383 AW972670 H2B359 AA525808 

432222 343347.1 AI204995 AW827539AW969908AW440776AAS28756 

432724 35330.1 X98266 N41124 

433005 357346_1 AW939074 AW939073 BE160476 AW939938 AW939206AW940012 AW939076 AA573577 AW750479AA674383AW970067 

433108 35895.1 AB002446 T03146 

433129 359479.1 AA577814 N68643 N58478 H79613A1910764 

433171 360292.1 AA579425 AW969965 AA579102 

43331 1 362908 1 AA6881 49 AW975336 R82513 R79237 

433371 364430.1 T25451 AA585296 AASB5305 

433440 36629 -1 AF052127 

433586 370470 1 T85301 AW517087 AA601054BE073959 

433782 37414 T AF090945 AW996764 A1064870 

434006 37855.1 AF113688 A1114617 

434092 379978.1 AA625155 BE502124 

434138 380572 1 AA625804AW418787AW074833 AI676642AI393368 

434280 382816^1 BE005398AA628622 AA994155 

435382 405265.1 N54493AA679039 N76605 

436023 413562.1 T81819AW393709AA703541 AW370185 

436359 41847.1 Z83806AJ132091 AJ132090 

436383 41888.1 BE065178AJ227879 

436740 426095 1 AW976133AA729943 AA806813 

436384 428731 1 BE046657 AA73661C AW827495 

437087 432797 1 AA745563AA746334 AA744168AA744044AA744034AA744066AA745569 AA744397 

437146 43371 1 AA730977A)261584 AA334473 Z43283AW876861 AW938044 BE150701 AW936262 AA306862BE565575 BE567380AA728920 

AA167512 AI239729 A1251752 AA485791 BE668425 AW962958 

437214 434730 1 BE092336 BE092259 BE092497BE092051 AA746882A1336378 

437240 436139.1 AA747537 BE089C68 BE089070 

437251 435242.1 AW976082 AW976085 AA747803 AA747680 

437274 435465.1 AA747965 AI248281 AI752458 

437324 43609.1 AL1 62077 AW023677 

437471 43747.1 AL390169AW277196 AW131001 

437798 443128.1 AW811767AA831890 AW977539AA768655 

437854 44418_1 AL119723AL119874AI909018U50537 

437938 44573.2 AI95CC87 N70208 R97040 N36809 AI308119AW967677 N35320 A)251473 H59397 AW971573 R97278 W01059 AW987671 AA90B598 

AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328617 AI2197e8 AA884444 N92578 
F1 3493 AA927794 AIS60251 AW87406B AL134043 AW236363 AA663346 AW0082B2 AA48B964 AA283144 AlBg0387 AI960344 AI741346 
AI689C62 AA282915 AW102898A1872193A1763273AW1 73686 AW150329 A1653832 A1762688 AA988777 AA488892 A1356394 AW103813 
AI639642 AA642789 AA856975 AW505512A1961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW85653e AAieOD09 
AA337499 AW951 101 AA251669 AA251874 AI819225 AW205862 AI683338 A1B68509 AW276905 A1533006 AA972584AA90B741 AW072629 
AW51 3996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA78041 9 AA551005 W80701 AW61 3456 AI373032 
AI664269 F00531 H83488 W37181 W78802 R66066 AI002839 R67840 AA300207 AW959581 T63226 F04C05 

438316 454745_1 AA789249AA904217 AA904142 

438366 456119 1 AA805760 BE000856 

438390 45652.T A1422017 A1422945 A1363249 A14231 13 A1925592 A1420796 A1208187 AI423279 A1423645 AI42409C AI359637 AL044732 D1 7003 

438739 464191.1 AA815391 AW5731B5 

438990 46760 1 AF085B90 H29949 H29856 

439183 46956"l AW970600AA503323H89218AF086031 H89112 

439330 47119J AF086147 W20068 N91464 
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440012 
440635 
442048 
442618 
444163 
444282 
444684 
444646 
445396 
445832 
446119 
446312 
446922 
447258 
447376 
447622 
448632 
448643 
448778 
448871 
449213 



449901 
460257 
460673 
460625 
450870 
460928 
461283 
451369 
461366 
461373 

461496 
461535 
451724 
461644 
452163 
452453 
462467 
452519 



473671_1 
47673J 
483290_1 



69866.1 
71439_1 
719483.-1 
724922.1 



830121 1 
83907_1 
84032_1 
84994.1 
851593J 
66479_1 
8571 1_1 
867325_-1 
867533_1 
87131_1 
B7182J 

882130.1 
888230.1 
902067.1 

918409ll 
920292.1 
920646 1 
921149.1 
92188.1 
922216.1 



462738 
462826 
453472 
463677 
463726 
453846 
454044 



464471 
464624 
464554 



464592 
454622 
454637 
464648 
464687 



977149 1 
978760.1 
983043.1 



1152648_-1 
1213795.1 
1221640_1 
1223842.1 



HI 0438 AA837479 H16299 AW978825 

AA861072H02819 R25946 

AW610331 AW610338AA897379 

AA974603 A1984319 AW340495 

AF150226A124Q137 

AI126098AI184746A1148621 

A1138955BE149059BE149027 

AI168422D80113T59074 

AI184665AL037304AW793549 

BE181792 A1222266BE181789 BE181837 

AI261545 N59134 AW875371 AW875247 

D29527AI275850 



BE143888AI382949 T06429 



BE410360AA442408AA315540 
AI640355 BE464385AW197B79 
AA002047 N72537 H64142 H81580 
AI674072 BE268487 

AW820313 BEQ65936 BE065948 BE 155653 A16B9473AW862275AW820265AW820438AW820331 
AW964334AW967230 T08913 R11895F11463 AA010271 
AW970107AA513951 AA010406 

AI744417R91614H77366 
H83979R84433AA017024 
H85334AA017286 H86654 
Al 791783 

AI792Q30 AW386 1 1 6 AW876 1 46 
AA018072 N46370 R84847 

H86887 H86898 H86906 AA01 961 9 AA0181 27 AA056306 AA053726 

AW970577AA861861 AA418368AA018329AA5018e5 

AI903765 A181 1 1 94 BE007U7 AW1 30760 

T61430AI82Q546A1821336 

AI863140W80703 R43474 

A1902519AI902518A1902516 

AW062499 AWQ62496 AW062600 AW062492 AW062494 AW062493 AW062496 AW062498 Ai902660 A1902649 



1226050.1 
1226951.1 
1227707 1 
1227976.1 
1229077.1 

1229118.1 



AI90551B A)905516 AI905457 A)905515 AW176013 AW176037 



AA02S735 BE264405 AL040323 AA323099 

BE077084 AW1 39963 AW8631 27 AW806209 AW806204 AWB06205 AW806206 AWB0621 1 AW80621 2 AW806207 AW606208 AW80621 0 
AI907497 

AL1 33800 AI916209 

AI921S23AW903707AW903687 

AL037925AL037931AL037967 

AL079389AL079390 

W28543 All 19531 

AL167686AL167590 

AW022393AW022578 

AW022885AWO23064 

AW062974AW859625 BE081366 

AW364844 AW364847 AW937534 AW937593 AW937659 

BE064097 BE064124 BE064087 AW371617 

AW372937 AW378043 AW377970 BE145B50 AW377858 AW377g64 AW378040 

AW85a722 

AW902126 AW792853 AW837703 AW83771C 
AW857191 AW857192 AW804060 

AW847505 AW811792 BE0ei442 BE061433 AW847506 AW806999AW806996BE061436 BE061430 BE142460 BE1 46499 AW806994 

AW809166 AW806991 AW814082 AW806992 BE061669 AW807002 BE146659 AW806996 AW807000 AW845743 AW845747 AW847504 

BE142458 BE061431 BE061435 AW847507 BE146650 BE142470 AW814096 AW807012 BE061438 AW807011 AW806gg3 BE142465 BE142459 

BE142462AW854330AW854331 BE061434BE061731 BE142464AW847501 AW807001 BE142463 AW811800 BE061437 AW811802 BE061440 

AW806g97AW806998 BE061745 BE061753 

AW810112 AW8101 14 AW810032 AW809631 AW810183 AW81017B 

U70071 AW883715AW81O7O1 

AW811613AW811611AW811610 

AW811960AW811957AW811959 

AW814473 AW816157 AW81 6064 AW81 3298 AW813315 AW816100 AWB13389 AW813338 AW813342 AW816108 AW81331B AW816105 

AW816160 AW816163 AW814480 AW813392 AW816063 AW813399 

AW813350AW81B082AW813476AW813383 
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454702 
454729 
454750 
454775 

454789 
464790 
454797 
454851 
464863 
464893 
454898 
464961 
454956 
46604C 



466207 
465225 
455235 



455511 
455573 
455586 



455761 
455831 
455874 
455880 



456212 
456235 
456493 



457340 

467427 
457437 
457474 
467626 
467892 
458080 
468172 



1234106.1 
1234742_1 
1234762.1 
1235093_1 
1237414 1 
1237794_1 
1239472_1 
1239833_1 
1 24661 2J 
1246711J 
1250028_1 
1250636_1 
1252361J 
1254554_1 
1256906_1 
1259748J 
1260400_1 
1262318.1 
1265634.1 



BE145915 BE145893 AW892892 BE145806 BE145901 AW814144 
AW817003 AW816997 AW817126 AW817331 AW817120 BE15B378 AW817121 
AW866286 AW819000 AW819153 AWB19014 AW866641 

BE160229 AW819B79 AW620179 AW819882 AW819876 AW820169 BE1S32C1 AW993736 BE1S2911 
BE156314BE156316AW820750 
AW820852 AW820773 AW821088 
BE161168 BE162466AW821260 

AW836127 BE071453AW991660BE071473BE071461 BE071469 BE071472 

AW836610AW836806 

AW837753 AW837754 AW837700 

AW838126AW838206 

AW847464 AW847462 BE063767 BE063756 

AW847725 AW847599 AW8476Q0 AW847706 AW847854 AW847595 AW847850 AW847795 AW847798 

AW862286 AW861934 AW862096 AW852274 
AW862530 AW862527 AW852526 

AW854829 AW854805 AW854841 AW854825 AW854822 AW854830 AW854835 AW854826 
AW861666 AW867776 AW861555 AW861567 AW867799 AW867768 

AW860972 AW862698 AWe62599 AW860988 AW860983 AW860898 AW860925 AW860922 AW860986 AW860984 AW860989 
AW947884 AW947918 AW947888 AW947883 AW947897 AW947910 AW947905 AW864751 AW947878 
AW994394 AW866900 AW865905 AW866891 AW866014 AW866898 



1280616.1 



1379514.1 
1330022.1 
1381603.1 
1384537.1 
1335884_1 
1385886.1 
1389984.1 
1396805.1 



1655565.1 
168688.1 
193331. 
202684.1 
233842.1 
322100.' 
336353. 
337879. 
341077. 
373012.1 
432926.1 



458841 734186.1 



AW875951 AW875950 AW875935 AW875948 AW875939 AW875957 
AW880861 AW880883 AW881117AW881012 

AWa97292 AW89725 1 AW897298 AW897248 AW897259 AW897250 AW897246 AW897242 
AW901809 AW901787 AW901795 AW901792 AW901744 AW901753 AW901807 AW901798 
12B4023.1 AW901942 AW901937 AW902016 AW901947 
1287679.1 BE16018B AW935785 BE160401 BE160319 BE160313 BE160395 
1288605J AW936969 AW936920 AWg36975 AW936906 

1289639.1 AW93B204AW938195AW938205AW938206AW938193AW938199AW938200AW938190AW938201, 

AW938192 AW938207 AW938194 
1321229.1 BE144762AW979091 
1332529.1 BE0049B8 BE004574 

1 334857 1 BE070794 BE070791 BE070792 BE070795 BE0707B9 BE070788 BE070786 BE070787 BE070790 
1 335422.1 BE008343 BE008321 BE008326 BE008314 BE00834S BE00833C BE00B344 BE0C8327 
1337549 1 BE011703BE011692BE011697BE011624BE011673BE011644 
1348563.1 8E064415BE064430BE064448 
1348720.1 8E064655BE153963 
1 349078.1 BE065209 BE065364 BE0651 10 BE0651 1 1 
1349554.1 BE065803BE065826 
1349716 1 BE066007 BE066017 BE066074 
1359566 J BE080895 BE080906 
1373969 1 BE144966 BE144957 BE144958 

BE152283 BE152260 BE152297 BE152292 BE152269 BE16226C 
BE153203 BE153146 BE152981 
BE155185 BE1 56264 BE1 55186 
BE159432 BE159313 BE159295 BE159242 
BE161004 BE162497 BE161135 BE160999 BE162492 
BE161001 BE1624g4 BE162470BE161172 
BE167014BE167068BE167062 
BE177000 BE177003 BE176816 BE176952 

AW450979AA136653AA136656AW419381 AA984358AA492073BE168945AA8C9054AW23B038BE011212I 



R11B13AA164376 T56632 
N51636T61874 T51829 
AA203637AA832266 H67452 
AA261B30 AW967855 H26953 AA262478 
R91600T87C79AA2914S5 
AL1 18754 AA333202 H38001 
AA492071 AA484229 AW969850 
AW971287AA524976AA513479 
AW969732 AW970974 AA515741 
AW972935AA525272 N28227 
T10073H14872AA604786 

AA744389 AA74427C AA744284 AA744299 AA745380 AA744337 AA846905 AA847698 

BE142723AA834047AWg37124 

BE007237 BE007496 BE007263 W19919 BE646311 AA9848ig 

AI444821 W26012 



AA046742 AA66126g AA001606 

AI940577 AI940580 AI940568 Aig40578 AI940569 AI7958S8 AI79S867 AI940572 
AI902169 A1902173 AI902323 AI9Q2174 AI902347 
BE17B517AI908132BE142437 



TABLE 1C 



Pkey: Unique number conesponding to an Eos probeset 
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Sequence source. The 7 digit numbers In llils column are Genbanl< Identifier (Gi) numbers, "Duntiam 1. at al," refers to »» publlcalion entilled "The 
DNA sequence of iiuman ciiromosorae 22.' Dunham I. et a)., Nalune (1999) 402:489495. 
Indicates DNA strand from which exons were pred'cted. 



4D0773 
400843 
400844 
400845 



400888 
400937 
400977 
401002 
401024 



401256 
401262 
401271 
401276 
401279 
401342 
401365 
401395 
401420 
401439 
401451 
401508 
401519 
401537 
401561 



401759 
401780 
401866 
401905 
401994 
402001 
402075 
402076 



7331445 
8131609 
8131616 



8072510 
8117251 
8117489 
8117619 
7232177 
7232177 
9957912 
8516137 
8568122 



9800062 lUlinus 



8117407 
8117410 
7249154 
8131678 



NLposltion 

106629-105760 
619C2-62137 

472381-472528,474170-474277,475328-475542,475878-476000 

101636-101881 
96756-97558 
4786-4992 
27682-27840 

17982-18115,20297-20456 
9162-9293 

198991-199168,199316-199548 



114220-115164 
7236-7605 

4411&44238,48208-48321 

5863-5970,7663-7784,8892-9023,9673-9807,10634-10789,15254-15403,23827-23958 
24746-24872,25035-25204 

39310-39474 

151830-152104,152649-152744 
29235-29336,36363-36580 

119239-121642 
199600-199875 



77898-78050 

60561-60802 

90044-90184,9111 

132430-132761 

149167-160692 



77081-77226 
69569-70101 
176341-176452 

172961-173056,173868-173928 
167373-167433,167936-168031 



35662-35799 

61292-61911 

15919-16096 

13536-13669 

3096-3242 

119572-119672 

11602-11771,46357-46489,68720.58916 

141362-141602 

92993-94026 

119926-121272 

110779-110983 



186786-1 87029,1 90607-190779,198218-198348 

10662-10838,19815-20018 

96277-96420,96979-97160 

119836-120185 

141309-143676 

25414-26310 

162333-162715 

64056-64168 

118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 
131932,132461-132575,133580-134011 



28397-2861 7,28920-29045,29135-29296,2941 1-29567,29705-29787,30224-30573 

73126-73623 

153965-154441,155599-156819 

42904-431 24,4321 1-43336,44607-44763,45199-45281,46337-46732 
68052-68223 

121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

128316-128627 

101610-101819 

173889-174062 
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60658-60738,61677-61803 

115812-116187 

110326-110491 

62382-62552 

114593-115588 

53526-53628,55755-55920,5753(M7757 



402522 
402630 
402646 
402553 
402604 
402716 
402727 
402812 



402901 
402922 
402938 
402996 
403005 

403029 
403047 
403073 
403086 
403092 
403106 
403172 
403212 
403214 
403277 
403331 
403344 
403362 



9211324 Plus 



9408716 Minus 



2996643 
5791501 
6984114 



7331404 
7464784 
7630897 
7630945 



24673-25170 

48292-48398,49564-49944 
20393-20767 
84065-84242 
54596-54777 



89392-89498,90358-90571 
194384-194645 

175426-175667 

19036-1 9401 , 1 9689-19849,1 9961-20102 
22365-22473 



142964-143260 

165035-165334,165420-166713 

174720-176016,175104-175406,175508-175813 

77162-77350,81338-81511 

166037-156210 

76723-77027,79317-79484 

27494-27642 



8569726 Plus 



403687 
403764 
403776 



8101229 Plus 



7708872 
7708960 
7381715 
7710671 



1414-1513,1624-1756 

142803-142922 
22733-23007 



404268 
404274 
404288 
404290 
404336 
404403 



9886577 Plus 

7159766 Plus 

8218045 Minus 

9711362 Plus 

9885189 Plus 

2769644 Plus 



13446-13645 

78218-78418,79571-79709 

41111-41281,46495-45716,47801-47910 

127319-127754 



104127-104318 

3512-3691 

36651-36813 

157951-158129 

72053-72238 



64835-64994 
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404516 
404538 
404594 
404639 
404653 
404676 
404684 



404881 
404894 
404939- 
404977 



405064 
405071 
406102 
405167 
405170 



405308 
405349 

405379 
405390 
405411 



405720 
405725 
405735 
405738 



8151803 

8247909 
9958262 
9796778 



6850447 Plus 



406322 
406360 
406367 
406397 
406399 
406434 
406467 



9558556 
4512267 
5923640 
4895155 
9797144 



7667810 
6758731 
7705124 
6768795 
8247789 
8272661 



7417725 
7534217 
9211754 
9212102 



7711305 
7711412 
8189273 



146359-146739 

114153-114322 

192748-192946 

16310-15510 

5779-14387 

164997-16523C 

66167-56342,58066^189,68891-69048,60452-60628 



110881-111020 

163217-153316,154043-154124,159185-159353,161290.161420,163544-163669,166127-166207,167654-167734 



16223-16319,16427-16513,16736-16859,16941-17075,17170-17287,17389-17529,18261-18357,18443-18578 

4913-5093,7310-746919472-9621,9951-10082 

66852-66081 

96066-96192 



142358-142546 

349-822 

81207-81416 

11115-11552 

120922-121296 

43796-43981,48246-48427,54141-54317 
37047-37198 

161475-161581,162930-163067 
12993C-130076 

34202-34361,35194-36336,46412-46475,45731-46968,4729647467,49549-49658,4979049904,60231-60342,63583- 

53667,54111-54279 
40778-41034,4138341573 



22332-22473,24333-24439 

94007-94177 

17503-17778,18021-18290 
123097-123260 



26662-27225 
53624-53759 
13409-13861 
106417-106621 



3460-3717 

4941049620 

74653-75173 



120621-120971 

14893-15148 

46271-46874 

17942-18163 

38116-38691 

30487-31058 

8044-8877 

38899-39369 

13136-13591 

20170-20611 

130230-130418 

7513-7673 

58313-58489 

127317-127454 



126417-126563, 
126036-125422 
177277-177384 
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TftBLF ?A' m GENFR IIP-RRGULATRD IN BLADDER Q^NCER 

Pke/; Unique Eos probeselldenlllier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD; Ungene number 
Unigene TitleiUnigene gene title 

R1: Ratio of mRNA expression in bladder tumors compared to normal bladder 



UnigenelD Unigene Tille 



418818 
412841 
421066 

444042 
421307 
421318 
421359 
459462 
430134 
433227 
421742 
417366 
444342 
451686 
454417 
458760 
422119 
422170 
441377 
445958 
434288 
435347 
453134 
444781 
436154 
436246 
436293 
433078 
438181 
443399 
437722 
457465 
409757 



441496 
435376 
427685 



437553 
459204 
439842 



443851 
447578 
445550 
454284 



AA228899 

AI751157 

AUQ76726 

R27299 

NM_004915 

BE539976 

U63973 

AK001589 

AA481396 

BE380149 

AB040923 

AW970004 

BE185289 

m 014398 

AA059246 

AI244459 



Hs,10 

HS.1Q 

Hs.10172 

Hs.10237 

Hs.103305 

HS.1035G1 

Hs.103816 

Hs.105167 

Hs.106223 

Hs.106e06 

Hs.107628 

Hs,1076 



Homo sapiens HUT11 protein mRlvIA, partial 
liypotlietical protein MGC113S2 
se 2, mitoc 



liypolhetical protein aJ10727 



AI791949 

AW273802 

BE326257 

AW1 89075 

AW014873 

AA032211 

NM 014400 

AA764950 

AW450963 

AI601188 

AW015188 

AW978608 

AA760881 

AW292947 

AW301344 

NIIL001898 

AA853978 

A1306616 

BE159984 

AA380177 

AW294603 

AW770956 

AI751124 

AF010268 

Al^1687g3 

Til 832 



AW298115 

AJ245671 

W92620 

AF034803 

AI829936 

AW194601 

AI040686 
AB011537 
AA280321 
AI393122 
AW449462 
AA912347 
AI242754 
AW297935 



trinucleotide repeat containing 9 
ferritin, ligtit polypeptide 
KIAAC590 gene product 



hypatltelical protein FU23047 



ESTs, Wealdy similar to JU0033 hypotheB 

DNA replication factor 

cystatinSN 

ESTs 

ESTs, Wealdy similar to KIAA0768 protein 
ESTs 

[lbulose-5-phosphate-3-epimerase 



Hs.122908 
Hs.123114 
Hs.1 24677 
Hs.1 25343 
Hs.125395 
Hs.125e45 
Hs.12703g 
Hs.127280 
Hs.127311 
Hs.127428 
Hs.1 2743 
Hs.127797 



ls.12844 EGF-like^omain, multiple 6 
ls.128666 
ls.12953 
^s.130497 

^s.13219 ESTs 

ls.132413 ESTs 

ls.132908 ESTs 

^s.133466 slit (Drosoplilla) homolcg 1 

^s.13392 tBltiering factor SEC34 

^s.134726 ESTs 

1s.1 34743 ESTs 

^s.i36585 ESTs, Weakly similarto JC5314 CDC28Axic 



138493 ESTs, Moderalelys/milarloALU7_HUMANA 

T71508 Hs.13861 ESTs, Weakly similar to T42383 probable 

AF077374 Hs.l39322 small proline-ricli protein 3 

Hs.139336 ATP-binding cassette, sub-family C (CRR 
— C(CFTR 



AF071202 
AW163564 
AA448718 



Hs.139336 
Hs.142375 
Hs.142605 
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427961 AW293165 Hs.143134 ESTs 

419888 AI243493 Hs. 144049 ESTs 

413943 AW294415 Hs.144687 Homo sapiens cDNARJ12981 is. clone MT 

445871 AI702901 Hs.1 45582 ESTs, Weakly siniilartoFOR4 MOUSE FORMI 

ls.145645 ESTs,Moderatelyslmil»loALU1_HUMANA 




ESTs, Weakly dmllariaZ195_HUMAN ZINC 
ESTs. Weakly similar to TQ8729 RING zinc 
MUF1 protein 

Homo sapiens ctone TaCIA00176 mRNA sequ 



AA960844 
W67730 
AF196478 
AA577730 
AA551104 
AL1 33990 
AA411880 



2108276AssDNA-b 



S.1 83752 microseminopFOtein, bela- 

KIAA0175 gene product 

Hs.184572 cell division cycle 2, G1 to Sand G2 to 

Hs,184727 ESTs 

Hs,186579 Homo sapiens, clone 1^MGE;4081483, mRNA 

Hs.187573 ESTs 

Hs.188401 annexinAlO 

Hs.ie8684 ESTs, Weakly similar to PC4259Mn 

Hs.18g048 ESTs. Moderately similar to ALUC_HUMAN I 

Hs.190642 ESTs 



140.859 
2.037 
0.018 



15.708 
207.938 
62.333 
32.959 
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428058 
431245 
453204 



441508 
453223 
5 5 442167 



Hs.191602 ESTs 

Hs.1 91687 ESTs 

Hs.191990 ESTs 

Hs.1 92371 down syndrome oriiical fegion protein OS 
■ no sapiens, SimilartoRIKENcDNA2010 

Hs.194293 ESTs, Wealdy similar to 154374 gene NF2 

protein tyrosine pliospliatase, re 
Hs.197484 ESTs, Weakly similar to JC4235 ring fng 
ttiyroid transcription factor 1 
gap junction protein, lieta 5 (connexin 3 




AA418ia9 

BE614410 

AW003578 

AI939885 

AW9B9408 

BE613183 

AW015203 

AW628325 

H18740 

AI984381 

AL044222 



AA256641 
AA490631 
BE513442 
BE540274 
D80400 



Hs.22604 
Hs.22630 
Hs,226351 
Hs.226414 
Hs,227767 
Hs.227g4S 
Hs.23017 
Hs.23044 
Hs.231872 
Hs.231926 ESTs 



ESTs, Weakly similar to SFR4_HU!*\N SPLIC 
hypothetical protein FLJ20121 
solute carrier family 1 5 (oligopeptide t 

ESTs 

CEGP1 protein 
small Inducible cytokine i 
lysosomal apyrase-like protein 
cofaclor required f " * ' 
ESTs 

ESTs, Weakly simll? to ALU8_HUMAN ALU S 
zinc finger protein 41 
serine (or cystine) proteinase Inhlblto 
Homo sapiens cOm FU22747 fis, clone K 
RAD51 (S. cerevlsiae) homolog (E coli Re 



Hs.232237 ESTs 



Hs,232327 ESTs 



ls.23248 
Hs,232521 
Hs.23255 

Hs',235331 

Hs!236533 

Hs,236894 

Hs.237a3 

Hs.238944 

Hs.239 

Hs.2393a8 

Hs,23960 

H5,241551 

Hs.24255 



hypothetical protein from EUROIMAGE 2005 



ESTs 

hypothetical protein FU10631 
forkhead box Ml 

Human ONA sequence from clone RPt-30' 
cyclin B1 

chloride channel, calcium activated, fam 
OKFZP434I1735 protein 



N32S60 Hs,2461 1 ESTs, Weakly similar to 154374 gene l>IF2 

AW500321 Hs.246766 Homo sapiens cDNAFLJ1236Crs. done HM 

NM 013936 Hs.247862 oKactoiy receptor, gmily 12, subfamily 

U5^83 Hs.248182 olfactory receptor, tamlly 1, subfamily 

AF097874 Hs.248226 caspase 14, apoptosls^elaled cysteine p 

■ ' anscrlptlon factor 19 (SC1) 

)onudeaseL(2r 



AW501465 



iL(2-,5--olfgoisQadenylale 
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AW177225 


Ha250158 










Hs.250618 


















AI133461 


































TOLUP ptotdn 








Hs.25482 




























BE1 60081 


Hs.256290 


S100 calcium-bindtng protein All [calglz 


















H5.25740 


ER01 (S. cerevfsiaeHike 






BEG47032 


Hs.257789 


















AI912498 




hypothetical piotein FLJ14996 










hypothetical piotdn FLJ10559 
















AL046988 




E^J^erately similar to ALU7 HUMANA 








Hs.268760 








AV654845 










BE019924 


H^271580 








Hs.271880 












integrin, alpha 2 (CD49B, alpha 2 subuni 










ESTs, Weakly similar to gonadotropin ind 






AA4a6293 




ESTs, Weal<ly similar to ALU3 HUMAN ALU S 






AA157632 


Hs. 272630 
















































AB0ia345 




KIAA0802 protein 


















































clone HQ0310 PRO0310p1 






AV658444 








450020 


AI680684 


Hs.282219 


















AW975920 


Hs.283361 










Hs 284227 








AF1 13018 


Hs.284302 


PR01621 protein 




























Hs.289015 


hypothetical protein MGC4171 










progesterone receptor 






AA293279 


H^29173 


hypothetical prolan FLJ20515 
















A^^2398 


H&292036 


ESTs, Weaidy similar to B34087 hypothetl 






BE386042 




ESTs! Weaidy similar to GGC1_HUMAN G Al^ 
















AF026944 


Hs.293797 








AA194055 


Hs.293858 












iiypothetical piotdn MGC13102 
















AH 86431 






















Hs.298623 








AI336329 


Hs.301519 


Homo sapiens cDNA FLJ12536 fs, clone NT 






















438366 


AA805760 


Hs!303567 


ESTs 


1.000 


452724 


R84810 


Hs,30464 


cyclin E2 


1.000 


429343 


AK000785 


Hs,307036 


Homo sapiens, Similar to epsin 3, clone 


0.494 


446466 


H38025 


Hs.308 


arrestin 3, relinai (X-arrestin) 


0.022 


430694 


AA810624 


Hs,30936 


ESTs, Weakly similar to H2BH_HUMAN HISTO 


15.744 


432789 


D26361 




KIAA0042 gene product 


0.302 




AW204069 


Hs!312716 


ESTs, Weakly similar to unnamed protein 


0.001 


453028 


AB006532 


Ha31442 


ReoQ protein-like 4 


13.392 


433091 


Y12642 


H5.3185 


lymphocyte antigen 6 complex, locus D 


2.766 


427122 




Hs.323910 


HER2 receptar tyrosine kinase (c-ert>b2. 


553.782 


453216 


AL13756B 


H5.32406 


Homo sapiens mRNA; oDNA DKFZp586G0321 (f 


84.115 


443247 


BE614387 


Hs.333893 


c-MyctaigetJP01 


79.385 


439632 


AW410714 


Hs.334437 


hypothetical protein MGC424a 


337.474 


431448 


AL137517 


Hs.334473 


hypothetical protein DKFZp56401278 


0.842 


411248 


AA551538 


Hs.334605 


Homo s^ns cDNA FU14408 lis, clone HE 


402.500 
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AW043909 


H5.334707 


aminoacylase 1 


191.179 


432842 


AWB74093 


Hs.334822 


hrpotheiicalpralelnMGC4485 


313462 . 


457292 


A1921270 


H6.334882 


hypothetical proleinFU1425l 


21.744 


451359 


H85334 


Hs.336623 


ESTs 


0.038 


440249 


AI246590 


Hs.337275 


ESTs 


a432 


4344B7 


AF143867 


Hs.337588 


ESTs, ModeratBly similar to S65657 alpha 


1.102 


447437 


U07225 


Hs.339 


purinergic receptor P2Y, G-proian coup! 


0483 


447519 


U46258 


Hs.339665 




1.032 


434192 


AW387314 


Hs.34371 


ESTs 


0.003 


453765 


BE279901 


Hs.35091 


hypothetical prolan FU10775 


0.056 


441020 


W79283 


Hs.35962 


ESTs 




463884 


AA355925 


Hs.38232 


KIAA0186gene product 


0,138 


453922 


AF053306 


Hs.36708 


budding uninhibited by benzimidazoles 1 


0.008 


453945 


NM 005171 


Hs.36908 


activating transcription factor 1 




447289 


AW247017 


Hs.38978 


melanoma antigen, family A, 3 


aoo2 


40762B 


U39196 


Hs.37169 


potassium Inwardly-rectl^lng channel, s 


aoo9 


423620 


N71320 




ESTs 


1.000 


436027 


Aia84053 


Hs.39972 


ESTs, Wealdy similar to 138588 reverse I 


a042 


407845 


AA426202 




Cbp;p300^nieraciing transaciivaicr, wit 


1.810 


443133 


AI033878 


Hs.41379 


ESTs 




434534 


H90477 




ESTs 


aoi3 


452934 


AA581322 


Hs.4213 


hypothetical prolan MGC16207 


3.679 


434952 


T10269 


Hs.4285 


Homo sapiens cDNA: FU22505 lis, done H 


2885 


432237 


AK001926 


H5.44U3 


poiybromo 1 


0010 


420900 


AL045633 


Hs.44269 


ESTs 


10.436 


408522 


AI541214 


Hs.46320 


Small prollne-rlch protein SPRK [human, 


a393 


435099 


AC004770 


H3.4756 




386.256 


431009 


BE149762 


Hs.48956 


gap JuncBon protein, beta 6 (connexin 3 


0922 


408947 


AL080093 


Hs.49117 


Homo sapiens mRNA; cDNA DKFZp564N1662 (f 


a003 


436847 


AIB63240 


Hs.49823 


ESTs 


175.910 


435854 


AJZ78120 


Hs.4996 


putative ankyrin-repeat containing proie 


Z584 


436291 


BE568452 


Hs.5101 


protein regulator of cytokinesis 1 


0610 


455506 


AA703584 


Hs.5105 


hypolheBcal protein FLJ10569 


aoo8 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar io ubiquitin-con 


1.089 


409287 




Hs.52792 


Homo sapiens mRNA; cDNA DKFZp586l1823 (f 


16.910 


435047 




Hs.54973 


cadherin-IIke protein VR20 


0612 




AL359053 


Hs.57664 


Homo sapiens mRNA full length insert cDN 


5.938 




AA918317 


Hs.57987 


3-cell CLUymphoma 11B (dnc nngerpro 


1.000 


439482 




Hs.58089 


ESTs 


0118 


439605 


W79123 


Hs.sasei 


G protein-coupled receptor 87 


0095 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to OAPI.HUIWN DEATH 


a055 


452240 


AI591147 


Hs.61232 


ESTs 


a221 


452316 


AA2984a4 




ESTs, Moderately similarto G786_HUMAN P 


Z595 


452747 




Hs.61460 


Ig superfamliy receptor LNIR 


3.677 


434876 




H3.51460 


Ig superfamliy receptor LNIR 


7.587 


444783 


AK001468 


Hs.62180 


anillin (DrosopbDa Scraps homolog), act 


a046 


438779 


NM 003787 




nucleolar protein 4 


a030 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interador 13 


1Z016 


440125 




Hs.56194 


ESTs ' 


aoo8 


461291 


R39288 


Hs.5702 


ESTs 


0012 


439953 


AW247529 


H3.5793 


platelet-aclnaiing factor acetylhydrola 


1.653 


440006 


AK000517 


HS.B844 


hypotheUcal protein FU20510 


a628 ■ 


437044 


AL035864 


Hs.59517 


CDMA for diffbrentlaily expressed C01 6 g 


140908 


418107 


R41726 


Hs.7284 


ESTs 


0146 


43632B 


BE085236 


Hs.75313 


aldo-keto reductase family 1 , member B1 


0649 


433675 


AW977653 


Hs.75319 


ribonucleollde reductase M2 polypeptide 


0237 


41441B 


AW409985 


H8.76084 


hypothetical protein MGC2721 


Z242 


414430 


AI346201 


Hs.76118 


ubiquitin carboxyl-lerminal esterase LI 


a202 


414682 


AL021154 


Hs.76884 


inhibilor of DNA binding 3, dominant neg 


2.318 


414807 


AI738616 


Hs.77348 


hydroxyprostaglandln dehydrogenxe 16-(N 


1.622 


414907 


X90725 


Hs.77597 


polo (Orosoph/a)-lilse kinase 


246.564 


451575 


AA767622 


Hs.78893 


KIAA0244 protein 


1.000 


400303 


AA242768 


Hs.79136 


UV-1 protein, estrogen regulated 


4.552 




U33632 


H5.7935I 


potassium channel, subfamily K member 1 




458921 


AI682088 


Hs.79375 


holocarboxylase synthetase (bialin-fprDp 


0246 


409235 


AA188827 


Hs.7988 


ESTs, Weakly similar to 138022 hypothefi 


7.249 


440371 


BE268550 


Hs,80449 


Homo sapiens, clone IMAGE'3535294, mRNA, 


0.792 


452732 


BE300078 


Hs.80449 


Homo sapiens, clone IIMGE:3535294, mRNA, 


360782 


417003 


AL038170 


H5.80756 


betaine-homocysleine methyllransferase 


6O690 


407584 




Hs.8173 


hypothelical protein FU10803 


a988 


417312 








402705 


417389 


BE2M964 


Hs!82045 


midkine (neuillB growth-promoing factor 


10.806 


428839 


AI7S7756 


HS.B2302 


Homo sapiens cDNA FU14814 lis. clone NT 


1.051 


438315 


R66795 


HS.82419 


ESTs 


0226 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


2a260 


417933 


X02308 


HS.S2962 


thymii^lale synthetase 


221.090 


418067 


A1127958 


HS.B3393 


cystatin E/M 


2396 
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438086 


AA336519 














troponin 1, skeletal, fast 






M34455 










AA284166 
































Hs.85982 










Hs.86211 








AW242799 


Hs.86366 










Hs.87128 


hypothetical protein FLJ23309 






S70348 










AA587773 




Homo sapiens, Similar to Rl KEN cDNA 5830 




445060 




Hs.38808 






453450 


AW797627 


Hs.89474 








BE537383 


Hs.89739 








AF039241 




histone deacetylase 5 










DKFZP434G032 piotein 










Homo sapiens, clone IMAGE:4132043, mRNA, 






Y07828 














utH^uHln carrier protein E2-C 






A^8158 










AI249036 


Hs.94292 


hypothetical protein FIJ233ll 






























ESTs Moderately similar to S29539 tibos 






























BRCAI-interacting protein 1; BRCAI-assoc 






























ESTs, Weakly similar to unnamed protein 


























AW970622 




9b:EST382704 MAGE resequences, MAGK Homo 
















AB002446 




















AI905517 




gb;RC-Bf091-210199-105 BT091 Homo sapien 






































































gb:CM1-BT0368-061299-060-c09 BT0368 Homo 










gb:DKFZp761D2315j'1 761 (synonym: hamy2) 


a994 








gb:DKFZp761H0216j'1 761 (synonym: hany2} 




























































gb:HSC1JA121 normalized infant brain cDN 




















gb:IL2-BT0731-260400-076-F04BT0731 Homo 










gb'MRO-HT007S-021299-OD6-d07 H'n)075 Honw 










gb:nli30c04,s1 NCI CGAP Pr3 Homo sapiens 










gb:nt58f 10,x5 NCI CGAP Pr3 Homo SEplens 










gb'nx85c05.s1 NCf CGAP GCB1 Homo salens 










gb:PMO-Sf^0014-260400-002-b08 SN0014Homo 






















454789 


BE155314 




gb:QV0-HT0367-150200-114-d02HT0367Homo 


toco 


433005 


AW939074 




gb:QV1-DT0069-010200^57-c12 OT0069 Homo 


aoi3 


456380 


BE150188 




gb:QVl-HT0413-010200-059-gC5 HT04t3Homo 


0.249 


455650 


BE054555 




gb:RC1-BT0313-301 299-012-009 BT0313 Honr,o 


1.000 


436383 


BE065178 




gb:RC1-BT0314-020200-012-h01 BT0314 Homo 


1.000 


413100 






gb:RC1-BT0314-31030Q-015-bO9 BT0314 Homo 


271.372 


428436 


BE080180 




gb:RC4-BT0629-1 20200-01 1-b10 BT0629 Homo 


0.002 


455831 


BE144966 




gb:RC6-HT0187-201099-031-c04 HT0187 Homo 


0.011 


434414 






gb:lr34b07.x1 NCI CGAP Ov23 Homo sapiens 


293.654 


414221 


AW450g7g 




gb:UI-H-BI3-ala-a-12-aUl.s1 NCLCGAP_Su 


0.807 


409488 


AW402825 




gb:UI-HF-BK0-aaq-d-08O-Ul.r1 NIH_MGC 36 


0.965 


437938 


AI950087 




gbMiq05cO2.x1 NCI_CGAP_Kid12 Homo sapien 


2.952 


461385 


AA017S56 




gbze39liai.r1 Soares retina N2b4HR Homo 


7.341 


449325 


AA001162 




gb:ze48b06.r1 Scares retina N2b4HR Homo 


0.004 


413316 


W91931 




gb:zh47c01.r1 Scares Jelaljiver_spleen_ 


0.004 
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401335 
401555 
401760 

401781 
401961 
402239 

402424 
402777 
402778 
402837 



403142 
403297 
403537 
403657 
404136 
404249 
404875 

404983 



Pkey: Unique Eos probesel UBntiSer number 
CAT number Gene cluster number 
Accession; Genbank accession numbers 



421934 
424040 
428436 

431322 
432189 
433006 
433108 
434414 



434672 
436383 
437240 



1339278_1 
1349119.1 
1351251J 



1510393J 
1529407 1 
209339_1 



331543_1 
34281 9_1 
357346_1 



AW402825 BE544338 
AI267606AA121045AA126521 
BE01 9688 BE1 44460 

BE065208 BE065224 BE065168 BE0B5313 

BE068098 BE068119 BE068083 BE058088 BE068120 BE068155 BE058111 

W91931 Wg4979 BE081744 

Z43712 BE156729 BE156538 BE156731 BE156673 BE156539 BE156674 BE158430 BE156672 BE156675 BE156432 BE166541 
AW45097g AA1 36653 AA136655 AW41 9381 AA984358 AA492073 BE1 58945 AAa09054 AW238038 BEQ1 1212 BE01 1359 BE01 1 367 

BE011368 BE011362 BE011215 BE011365 BE011363 
C16195C16230C16211 C161B4C16251 



AA300625R16859R16860AW898335W24337 
AA334400AA334257AW966124 

BE080180 AW827313 AW231970 AA995028 AA42B584 AW87271 6 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 AA528743 
AA552874 AA66475B AW063245 AI267634 AW070190 AW893483 AA770330 AA906928 AA906682 AA758746 AA551717 AW06331 1 AA429538 
AW970622 AA503009 AA502998 AA502989 AA502805 T92188 



AW939074 AW93g073 BE1 60476 AWg39938 AW939206 AW94001 2 AW939076 AA573577 AW750479 AA574383 AW970057 



AI798376 S46400AW811617AW811B16W00557 BE142245AW858232 AW861851 AW858362 AA232351 AA218567AA055556AW858231 

AW857641AW814172H66214AW81439BAF134164 AA243093AA1 73345 AA1 99942 AA223384AA227092AA227080T1 2379 AA0921 74 

T61139AA149776AA699829AW879188AW813567AW813538AI267168AA157718AA157719AA100472AA100774AA130756AA157705 

AA157730 AA1 5771 5 AA053524 AW849581 AW854566 C06254 AW882836 T92637 AW812621 AA206583 AA209204 BE1 55909 AA226824 

A1829309AWg919S7N66951 AA527374 H66215AA04S564AI69426SH60808AA149726AW19S620BE081333BE073424 AW817B62 

AW817705AW817703AW817559BE081531 H59570 

AF147340T51948T62029 

BE065178AJ227879 

AA747537 BE089068 BE089070 

AI960087N70208 1^97040 N36809AI308119AW967677 N35320AI251473 H59397AW971573 R97278 W01059AW967671 AA908698AA261875 
Ai820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AAB84922 BE328517 AI219788 AA884444 N92578 F13493 
AA927794 AI560251 AW87406a AL1 34043 AW235353 AA663345 AW008282 AA488964 AA2831 44 AI890387 AI950344 AI741 346 AI689062 
AA282915 AW1 02898 AI8721 93 AI763273AW1 73586 AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI639642 
AA642789AA856975AW505512AI961530AW629g70BE612881 AW276997 AW513601 AW512843AA044209AW856538 AA1 80009 AA337499 
AW961 101 AA251669 AA251874 AI819225 AW205862 A)683338 A1858509 AW276905 A)633006 AA972584 AA908741 AW072629 AW513996 
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AA293273 AA969759 N76628 N22388 H84729 H60062T92487 AI022058 M780419 M551005 W80701 AW613456 AI373032 AI554269 TO 
H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 

438990 46760 1 AF085890 H29949H 29856 

439780 47673_1 AL1 09688 R23665 R2657a 

440012 4B3290_1 AA861072 H02819 R25946 

443309 56604_1 AI821 874 AI821 868 AA630932 AA653897 AA650103 AI821131 AI821124 

444163 593658J AI126098 AM84746 AI148521 

449325 80480_1 AA001162 AA018950AA0175Q5 R84446 

451385 85787_1 AA017656 AA017374AA019761 

453823 982526J AL137967 BE064160BE064186 

453846 983043_1 AL1 57686 AL1 57590 

454789 1234742_1 BE156314 BE156316AW820750 

4551 70 1 256906_1 AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860926 AW860922 AW860g86 AW860984 AW860989 



BE091833BE091B74BE09ie71 

BE141 140 BE141 139 BE141 10S BE141 143 BE141 127 BE141202 BE141108 

BE144986BE144957BE144958 

Al557388BE1SB93a 

AI905517AI905455AI9054S2 



im 1. el al." refers to the publication entitled The DMA 



402239 
402305 
402424 
402777 
402778 
402837 



404136 
404249 
404876 
404917 
404983 
405238 
405364 
405531 
405601 
405621 



9588235 Plus 



9801324 
7341851 
4432779 
7249119 
2281075 
9665194 



NLposilion 

126234-126359.128060-128236 

15736-16352 

162520-162657 

83126-a3250,85320-85540,94719«287 

83215-a3435.83531-83656,83740-83901,84237-84393,a4955-85037,8629O-86814 

124054-124209 

38175-38304,4213342266 

40832-41362 

64925-65073 

126786-126948 



2013-2186,9570-9758,11136-11309,19429.19677,21210-21455,23368-23562,24342-24527,29132^9320 



142647-142771,145531-145762 



96588-96732,97722-97831 



5523811 



51178-51374,52000-52173 
51728-51836 

48325-48491,4913649252 
35602-35803 
147335-147935.14 
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Table 3A: Preferrsd thBrapeulfc targeb for bladder cancer 

Pkey; Unique Eos probssel IdBnllBer number 

ExVVscn: Exemplar Accession number, Genbani< accession number 

UrtlgenelD: Unigene number 

Unigene Tille:Unlgene gene tiUe 

R1: 901)1 peicentileQfbladdertumorAlsdlvldedbythe90th peicenllle of nonnal bladder sample Als 

R2: gotti percenllieafbladdertumorAls divided by the 90th percentile of normal body sample Als 



Pkey ExAccn 

421948 L42583 

439926 AW014a76 

413324 V00571 

421110 AJ260717 

417308 H60720 

431211 bl86849 

418406 X73501 

446619 AU07BB43 

433001 AF217513 

408243 Y00787 

417715 AW969587 

417720 AA205625 

423673 BE003054 

418007 M135D9 

413753 U17760 

441633 AW968644 

422168 AA5B5894 

407242 M18728 
405033 

449230 BE613348 

406685 M18728 

420159 AI672490 

415511 AI732617 

426028 NM_001110 

424008 R02740 

428651 AF1 96478 

402230 

452747 BE153855 

416065 BE267931 

428450 NM_014791 

418322 AA284155 



412610 
41B663 AKOOIIOO 



424834 AK001432 

431958 XB3629 

423725 AJ403108 
401780 

424308 AW975531 



ls.334309 
ls.137007 
1s,75294 



Hs.75517 
Hs,112242 
H5,11240a 



412140 , 
401781 

425234 AW1 52225 

432842 AW674093 

408380 AF1 23050 

449722 BE280074 

422809 AK001379 

400409 AF1 53341 

429113 D28235 

444371 BE540274 

443171 BE281128 

441362 BE614410 

439963 AW247529 

407137 T97307 

426088 AF038007 

443426 AF09B158 

414761 AU077228 

413063 AL035737 

421508 NM_004833 

443162 T49951 



Hs,184339 

Hs,84113 

Hs.74126 

Hs,41690 

Hs,76a88 

Hs.38178 

HS.1S3408 

Hs,2877 

Hs.132127 

Hs.154443 

Hs.82962 
H5,83484 
Hs,73625 



corticotropin releasing iiormons 



Hs. 182362 
Hs.17202a 
Hb, 137555 
Hs,188401 



clone HQ0310PROO310p1 



matrix melalloproteinase 12 (macropliage 
matrix melalloproteinase 1 (interstitial 
lamini(i,bela3(nicein(125l<D), l<alinin 
normal mucosa of esoptiagus specific 1 
SI 00 calcium-binding protein A7 (psorias 
gb:Human nonspecific crossreacling anilg 
C1002652';gi|544327|5p|Q04799|FMO5_RABrr 
melanoma cell adhesion molecule 
gb'.Human nonspecific crossreacling aniig 
Homo sapiens CDMA; FU21245fls, clone C 
ESTs 



NM_003105':Homo sapiens sortilin-relaled 
Taiget Exon 

Ig superfamily receptor LNIR 
proliferating cell nuclear antigen 
KIAA0175 gene product 
oyclin-dependentliinase inhibitor 3 (CDK 



hypothetical piolein MGC12702 
hypothetical piDieinFU23468 
Homo sapiens cDNA FU10570 fis. clone NT 
cadherin 3, type 1, P-cadherin (placenta 
hypothetical protein LOC57322 
NM.005557':Homo sapiens keratin 16(foca 
minlchromosome maintenance deficient (S. 
C12000586*:gi|B330167|dbi|BAA86477.1|(A 
thymidylate synthetase 
SRY (sex determining region Y)-box 4 



ke(rab 



Taiget Exon 
ESTs, Weakly similar to 138022 liypotheli 
tiypotlietical protein lv1GC4485 



putative G prolein-coupled receptor 
tiypotlietical protein FLJ10549 
Insulin-like growth factor 2 (somatomedi 
Homo sapiens winged hellx/fbrkhead trans 



Hs.239 
Hs.9030 
Hs.23044 
Hs.6793 



Hs.77256 
Hs.75184 
Hs.105115 



KM Ml 

TONDU 

RAD51 (S. cerevisiae) homolog (E coli Re 
plalelet-activaling factor acetylhydrola 
gb:/e531i05.sl Scares fetal liver spleen 
ATPase, Class I, type SB, member 1 



enhancer of zeste (Drosophila) homdog 2 
chiUnase 3-like 1 (carriage giycopiote 
absentin melanoma 2 
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40 
45 



60 
65 
70 
75 

















AAa04B98 










441495 


AW294603 












AF019225 


Hs,114309 


apolipoprolein L 






















Hs,295449 










AK000517 




hypolhelical prolan FLJ20510 










Hs.8375a 


CDC28 protein kinase 2 






400289 


























AW271106 








1.22 




NM 001898 








2.93 




AI393122 


Hs.134726 












HS.22C529 










AW411307 




CDC45 (cell division cyde 45, S.cerevis 




















BE616633 


Hs.170195 


bone morphogenelic protein 7 (osteogenic 


3.25 


2.46 








gap junction protein, t)eta 6 (connexin 3 


3.23 


2.88 






H3.260855 












protease Inhibitor 3, sk'irHlerived (SKAL 


ai8 


{54 




AI908165 




GATA-blnding protein 3 (T-cell rsceptor 


ai7 


5.44 




AL033527 


Hs!92137 


v-myc avian myelocytomatosis viral of?cog 


3.13 


15.65 




BE314524 


Hs.78776 






2.46 










3.11 


3.30 




AW1 03364 




Inhlbin, beta A (actlvin A, acSvin AB a 








AL035668 


Hs.73853 










A1127076 


Hs.306201 


hypothsllcal'pratein DKFZpS6401278 








AI249468 












AA291377 








14.95 




AV660033 


Hs.2066 




2,97 


2.10 




























AF261555 














Hs!l36348 






















AA628980 












AA256641 


H8.236894 










AW238299 




UL16 binding protein 2 




















AW248508 


Hs.279727 


















2,68 




AW351839 


Hs, 124660 


Homo sapiens cDNA; FU21763 fe, clone C 




2.46 




AW408164 


H5,249184 


transcription lactor 19 (SCI) 




































ESTs, Weakly similar to S64054 hypoiietl 








AF143867 


Hs.337588 


ESTs, Moderately similar to S65657 alpha 








AL038170 




























NMl003105*:Homo sapiens sortil'n-ielaled 


Z69 


13^45 












1.36 








hypothetical protein DKFZp56401 278 ^ 












ENSP00000231844*:Ecolropic vims irtsgra 




13.40 




AA551538 




Homo sapiens cDNA FLJ14408 fis, clone HE 




2.43 




NM_002153 


Ha. 155 109 


hydroxysterold (17-beta) dehydrogenase 2 


267 


2.68 




AC004770 








1.79 




NM 005982 


Hs.54416 












Ha, 222024 










AI274139 












All 86431 


Hs,296638 












Hg, -161566 




















417312 


AW888411 


Hs!250811 


leukemia-associated phosphoprotein p18 ( 


Z64 


l'31 


430157 


BE34870B 


Hs.278543 


ESTs 


2.63 


2.58 


436481 


AA379597 


Hs.5199 


HSPC1 50 protein similar to ubiquitin-con 


2.61 


1.93 


418686 


Z36830 


Hs.87268 




Z60 


1.62 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


2.59 


2.87 


429276 


AF056085 


Hs.198612 


G prolein-ccupled receptor 51 


2.57 


3.89 


439738 


BE246502 




sema domain, immunoglobulin domain (Ig), 


2.57 


2.49 


409632 


W74001 


Hs!55279 


serine (or cysteine) proteinase inhibilo 


2.66 


1.43 


414812 


X72755 


Hs.77367 


monokine induced by gamma interferon 


2.54 


3.10 


461668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


2.51 




421379 


Y15221 


Hs.103982 


small inducible cytokine subfemily B (Cy 


2.60 


191 


414416 


AW409985 


Hs.76084 


hypolhelical protein MGC2721 


2.49 


1.78 


429612 


AF062649 


Hs.252587 


pituitary tumor-transforming 1 


2.49 


1.80 


416658 


U03272 


Hs.79432 


fibilllln 2 (congenital contractural ara 


2.49 


3.46 


437553 


AI829935 


Hs.130497 


ESTs, Weakly sMar to MAT8_HUMAN CHLOR 


2.48 


2.36 
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418941 AA452970 

414807 AI738616 

450983 AA305384 

407788 BE5149a2 

449019 AI949095 

417366 BE18523g 

420370 Y13645 

408000 L11690 
406399 

435563 AF210317 

442117 AW664964 

436246 AW450963 

433078 AW015188 

424012 AW368377 

411263 BE297B02 

432829 W60377 

415025 AW207091 

436293 AI601188 

415989 AI267700 

418067 All 27958 

436291 BE568452 

422278 AF072873 

428479 Y00272 

443247 BE614387 

424364 AW383226 
405932 

419741 NM.007019 

423271 W47225 
402305 

421064 AI245432 

427747 AW411425 

437181 AI306616 

431890 X17033 

433437 U20535 

407581 R48402 
400845 

419359 AL043202 

418526 BEfl19020 

448045 AJ297436 

453459 BE047032 

419183 U606S9 

436251 BE615065 

445911 AI985987 

420876 AA918425 

438817 AI023799 

434293 NM_004446 

422765 AW409701 

418216 AA662240 

437915 AI637993 

409420 Z15008 

431441 UB1961 
400773 

414937 AA524394 

424587 J05070 



H3.239527 E1B-55kDa-as 



441801 AW242799 

439780 AL109688 

452732 BE300078 

434876 AF160477 

430152 AB001325 

453134 AA032211 

412719 AW016610 

442577 AA292998 

409402 AF208234 

414774 X02419 

439318 AW837046 

447334 AA516032 

432015 AL157504 

429002 AW248439 

410553 AW016824 

420783 AI659838 

407811 AW190902 

448988 Y09763 

400303 AA242758 



H5.69380 

HS72307 
H3,120910 



Hs,334562 
Hs.333893 
Hs.201189 



Hs.90073 

Hs.85838 

Hs.20166 

HS.2577B9 

H8.89663 

Hs.2965a5 

Hs.145645 

H3.177744 

Hs.ie3242 

Hs,3796 

Hs,1578 

Hs.283099 

H8.202312 

Hs,54451 

Hs,2794 

Hs.294022 
Hs.161738 
Hs,11223 
H5,339 



ER01 (S.ce 

SI 00 calcium-binding protein A2 
ESTs, Weakly similar to T22341 hypolheU 
small prolifie-rich prolein 1B (cornilin) 
uroplakin 2 

bullous pemphigoid antigen 1 (230/240kO) 
NM_003122':Homo sapiens serine protease 
solute carrier family 2 (facilitated glu 
ESTs; hypottietlcal protein for ISMGE;447 
ESTs 

Homo sapiens cONA FLJ12231 lis, done MA 
tumor protein 63 kDa with strong homolog 
kinesin-IIke 8 (mitotic centromere-assoc 



cyslatin E/M 

prolein regulator of cytokinesis 1 
frizzled (Oroscpliila) iiomoiog S 
cell division cycle 2, G1 to S and G2 to 
c-Myc target JP01 

ESTs, Weakly similar to G01763 atrophin- 
C15000305:gi|3806122|gb|AAC69198.1|(AF0 
ubiquitin carrier pmlem E2-C 



serinerthreonine kinase 12 

ESTs, Weakly similar to KIAA0758 protein 

inlegrin, alpha 2 (C049B, alpha 2 subuni 

caspase 6, apoptosis-related cysteine pr 

P3ECSL 

N»4_0031 05*:Homo sapiens sortilin-related 
chromosome segregallon 1 (/east honulog) 
solute carrier family 16 (monocarlxw/lic 
prostate stem cell antigen 
ESTs 

cytochrome P450, subfamily XXIV (vitamin 

nucleolar protein (KKE/D repeal) 

ESTs, Moderately similar to ALU1_HUMAN A 



Hs.61460 
Hs.234642 
Hs,1 18493 
Hs.816 



Hs.6527 
Hs.91109 
Hs.159115 



Hs.99923 
Hs.400g8 
Hs.22785 
Hs.79136 



AF15q14 protein 
Homo sapiens clone N11 igTera2D1 tef 
laminin, gamma 2 (nicein (lOOkD), kalln 
sodium channel, nonvoltage-gated 1 < 



hypothetical protein FU14960 

matrix melailoproleinase 9 (gelatinase 
isocitrate dehydrogenase 1 (MAOP), s 
purinergic receptor P2Y, G-protein ooi 
EGF-like-domaIn, multiple 6 



Hs,37129 sodium channel, 
ESTs 



nRNAyi length Insert 
Homo sapiens, clone IMAGE:3535294, mRNA, 

Ig superfamily receptor LNIR 

aquaporin 3 



cystatin B (slefin B) 
plasminogen activator. Urokinase 
G proiein-coupled receptor 56 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586O0724 (f 
junction plakogk)bin 
hypothetical protein MGC14128 
lecHn, galactoside-Wnding, soluble. 7 
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AI219207 




hypolhefioal protein FLJ13459 


1.92 


Z77 




AL1 34847 








1.21 




BE563085 










428928 


BE409838 












AA641726 




hypothetical protein l\AGG4171 






400846 






sortilin-related receptor, L(DLR dass) 




193 


417409 


BE272506 












HU 014400 








116 


















Hs.89603 










NM_01S361 








2,01 




AI133461 




insulin-lllte growth factor 2 (somatomedi 




1.87 


430168 


AW968343 




DKFZP434i1735 protein 








AK001763 




hvpoltielical prolein FU10901 




177 








NM_025206*:HonD sspiers hypothetical pro 


185 






AJ271216 


Hs.22880 


dipepiidylpeplidase III 


1.85 


1.59 


















Hs.89414 


chemoldne [C-X-C moUl), receptor 4 (fus 




1.03 




AW410976 






i84 


1.64 






Hs.273330 


Homo sapiens, olons lliMGE:3544662, mRNA, 




1.74 


462934 


Mssisa 


H5!4213 


hypothetical prolein MGC1 5207 


182 


1.84 




NM_002204 


Hs.265829 


integrin, aiptia 3 (antigen CD49C, alpha 


1.82 


1.83 




BE270447 




ubiquitin carrier protein 




1.74 
















AW971155 


Hs.293902 


ESTs, Wealily similar la I3HUSS protein d 




a67 


















Hs.15830 


hypothetical protein FLJ12691 


180 


aoo 


408522 


AI541214 


HS45320 


Small proline-rich protein SPRK [human, 


1.80 


1.02 


428330 


L22624 


Hs.2256 


matrix melalloproteinase 7 (matrilysin. 


1.80 


2.26 




BE279383 




plakophilin 3 








AW419196 




hypothetical prclein FLJ13782 




5^9 




NM_002497 


Hs.153704 










AK001504 












BE001836 




ESTs, Weal(lysimilartodJ365012.1 [H.sa 








AL035864 








8.80 


























AW378065 






173 


199 




AA448542 




G antigen 78 


173 


8.65 








Homo sapiens mRt>lA; cDNA DKFZp434K0322 (f 








X90725 






172 


165 






Hs.347524 


solute carrier family 7 (cationlc amino 








AI471630 




KIAA0144 gene product 


171 


162 






Hs.11 6206 




1.71 


5.52 




AK001888 


Hs.i 89095 


^niiar lo SALLHsal (DposophilaHike 


1.71 


8.55 




NM_014398 


HS.10B87 


similar to lysoscme-associaied membrane 


1.71 


8.55 




AA321649 








8.55 






Hs]272499 


shcri-chain alcchd dehydrogenase fanilly 


171 


13.98 




AK000614 


Hs.i 8791 


hypoihelical pioleln FU20607 


1.70 


1.84 




BE391635 




transgslln 2 


1.69 


1.61 




A1199268 


Hs.19322 


Homo sapiens. Similar to RIKEN cDNA 2010 




8.45 




AF017307 


Hs.l 66096 


E74-li1<e factor 3 [ets domain transcript 


169 


1.60 












207 




AA298484 


Hs.61265 


ESTs, Moderately simlar to G786_HUMAN P 




0.70 




AI591147 












AA1 94055 














Hs.i 081 06 










BE001596 












BE250127 




CDC20 (cell division cyde 20, S. cerevi 


















423161 


ALW227 


Hs.124776 


downstream of cadherin 6 (by 3.3kb) 


163 


181 


453968 


AA847843 


Hs.62711 


High mobility group (nonhistone chromoso 


162 


1.51 


402777 






C1002652*:gi|544327|sp|Q04799|FMO5_RABIT 


1.62 


2.33 


436569 


BE439539 


Hs.279837 


glutathione S-lransferase M2 (muscle) 


1.62 


2.16 


417516 


L24203 


Hs.82237 


ataxia-telangiectasia group D-associated 


1.62 


1.27 


413385 




Hs.840 


indoleamine-pyrrole 2,3 dioxygenase 


1.61 


2.05 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


160 


1.78 


450635 


AW403954 


Hs.25237 


mesenchymal stem cell prolein DSCD76 




1.63 


437016 


AU076916 


Hs.5398 


guanine monphosphate synthetase 


159 


1.50 


461982 




Hs.27373 


Homo sapiens mRNA; cDNA DKFZp56401763 (f 


1.58 


1.92 


422247 


U18244 


Hs.113502 


solute carrier family 1 (high affinity a 


1.57 




408908 


BE296227 


HS.250B22 


senne/threonine kinase IS 


1.56 


7!80 


433159 


AB035898 


HS.1505B7 


klnesln-llka prolein 2 


156 


7.80 


443211 


AI128388 


HS.143B55 


ESTs 


1.56 


7.80 


409893 


AW247090 


Hs.57101 




1.55 





158 



wo 03/003906 



PCT/LS02/21338 



10 
15 



426900 AW153564 

421066 AU076725 

413804 T64682 

418641 BE243136 

444783 AK001468 

414035 Y00630 

418543 NM.005329 

429211 AF052693 
402260 

424264 D80400 

433091 Y12642 

422164 NM_014312 

444163 AH 26098 

452304 AA025386 

446182 AW189787 

408930 AA146721 

429359 W00482 
406467 

424244 AV647184 

422094 AF1 29535 

431322 AW970622 

425247 NM_005940 

424420 BE614743 

421817 AF146074 

422119 AI277829 

418729 AB028449 

418399 AF131781 

453028 AB006532 

431369 BE184455 

453321 AI9a4381 

421478 AI5a3243 

425726 AF086aO8 

406906 Z25424 

429413 NM 014058 

418678 NM 001327 

431840 AA534908 

417433 BE270266 

422397 AJ2233SS 
403903 

425721 AC002115 

413943 AW294416 

431515 NM_012152 

443859 NIA.013409 

419743 AW40g762 

422330 D30783 

420281 AI623693 

424717 H03764 

440304 BE159984 

422170 A1791949 

417599 AA204688 

411874 AA096106 

449961 AW265634 

418606 AA084248 

432239 X81334 

453216 AL137566 

423634 AW959908 

423017 AW178761 

453365 AA035211 

439239 AI031540 

450684 AA872805 

425650 NU_001944 

406974 M57293 

410348 AW1B2663 

412661 N32860 

419121 AA374372 

426320 W47595 

426968 U07616 

432097 X51730 

452401 NII/L007115 

453389 BE273648 

419078 M93119 

430378 Z29572 

428182 BE386042 

451844 T61430 

415178 D80503 



H5.86947 
Hs,62180 
Hs,75716 



Hs,23938a 
Hs!l12377 



Hs.111862 
Hs,87889 
Hs.84753 
Hs.31442 
Hs.251754 
Hs.232521 
Hs.9725a 
Hs.159330 



n (Drosophila Scraps homolog), act 
serine (or cysteine) proteinase ititiibilo 
hyaluronan synthase 3 
gapiunctlon protein, beta 5 (connexin 3 
NM_001436':Horno sapiens fibrillarin (FBL 
Human DNA sequence from clone RP1-304B14 
lymphocyte antigen 6 complex, locus D 
oortic al thymocyte fsceplor (X, laevfe 
gb:qc54g07.x1 SoarBS_plaoentaJto9week5_ 
ESTs, Weakly similar to S10590 cysteine 
ESTs 

hyMli*alp«)temFU21588 ^ 

hy^etel protein hCLA-lso 

F-box only protein 5 

gb;EST382704 MAGE resequences, MAGK Homo 1 

Target Exon 

matrix metalloprolelnase 1 1 (stromelysln 
prostaglandin E synthase 
ATP-binding cassette, sub-family C (CFTR 
KIAA0590 gene product 
helicaso-moi 

hypothetical protein FLJ 12442 
RecQ protein-fte 4 

secrolory leul<ocytc arclease inhibi'.cr ( 
ESTs 

ESTs, Moderately similar to S29539 ribos 



Hs.236331 
Hs.25333 
Hs.1925 



Hs.95469 

Hs.24611 

Hs.89626 

Hs.169300 

Hs.173034 

Hs.2905 

Hs.29352 

Hs.32963 

Hs.89584 

Hs.2556 

Hs.293317 



DESC1 protein 

oancer/testis antigen (NY-ESO-1) 
POU domain, class 5, transcription fscb 
5T4 oncofetal trophoblast glycoprotein 
Homo sapiens cDNA: FU22496 fis, clone H 
C5001632':gl|10645306|gb|AAG21430.1|AC00 



Homo sapiens cDNA FU12981 Us, clone ^^^ 



Homo sapiens clone 24416 mRNA sequence 




G protein-coupled receptor 39 
matrix metalloprolelnase 13 (collagenase 
Homo sapiens mRNA; cQNA DKF2p586G0321 (f 
heparin-binding growth factor binding pr 
serine [or cysteine) proteinase inhibito 
S0X7 SRY (sex determining region Y)-lx)x 
ESTs 

interleul<in 1 receptor, type II 
desmoglein 3 (pemphigus vulgaris antigen 
Target Exon 

gb:Human parathyroid hormone-related pep 
ESTs 

ESTs, Weakly similarto 154374 gene NF2 



ir,bela2 

amphiphysin (SW-Mann syndrome with br 
progesterone receptor 
tumor necrosis factor, alpha-Induced pro 
oadherin 6, type 2, K-cadheiin (fetal ki 



Hs.322850 ESTs 



tumor necrosis factor receptor superfami 
ESTs, Weakly similar to GGC1_HUMAN G ANT 
gb:ycOBa03,s1 SIratagene lung (937210) H 
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417791 AW965339 

421373 AA808229 

427356 AW023482 

421070 AA283185 



416661 AA634543 

418738 AW388633 

412723 AA648459 
404B77 

443054 A1745185 

403047 

406434 

412530 AA766268 

433365 AF026944 

427666 AI791495 

429504 X99133 

431474 AL133990 

411880 AW872477 

414221 AW450979 

444649 AW207523 

456034 AW450979 

414521 D28124 

439569 AW602166 
AI204995 



432222 
407846 
457292 



416225 
400288 
421218 



highly expressed in cancer, rich in lei 



ESTs 



Hs.266273 
Hs.293797 
Hs.1 80142 



ESTs, Weakly similar to 138022 hypothetl 
hypothetical protein FLJ10339 
ENSP00000251056*;Plasma membrane calcium 
ring finger protein 
IGF-II mRNA-binding protein 3 

hypothetolpSA'F301222 
N^L005365:Homo sapiens melanoma antigen,, 
yes-associated protein 65 l<Da 
NM.005656*:Homo sapiens transmembrane pr 
NM.030579*:Homo sapiens cytochrome b5 ou 
hypothetical protein FU13346 



AI921270 

BE041395 

AI204995 

NM_002205 

M21305 

AA577730 



gb;UI-H-B13-ala-a-12-0-Ul.s1 NCI_CGAP_Su 
neuroblastoma, suppression of tumorigeni 
CEGP1 protein 

gb:an03c03.x1 Stratagene schizo brain S1 
Cbp/p300Hnteracting transacBvator, wit 
hypothetical protein FU14251 
ESTs, Weakly similar to unknown protein 
gb:an03c03.x1 Stratagene schizo brain S1 
integrin, alphas (libronectin receptor, 
gb:Human alpha sat A and satellite 3 
ESTs, Weakly similar to PC4259 fenitin 
integrin, alphas (fbronectin receptor. 



Pkey: Unique Eos probesel identifier ni 
CAT number; Gene cluster number 




AW872477BE088101 T0 



AW450979AA136653AA136656AW419381AA984358AA492073BE168945AAB09054AW238C38 BE011212BEC11359 BE011367 
BE011368 BE011362BE011215 BE011365 BE011363 

AI267700 A1720344 AA191424 AI023543 AI'',69 6 33 AA1 72056 AW958466 AA172236 AW953397 AA355086 
BE041395 AA49f826 AA621946 AA71 5980 AA633 1 02 
AW970622 AA503009 AA502998 AA5029B9 AA5026C5 T92188 
AI204995 AW827539 AW969908 AW440776 AA528756 
AA628980 All 26603 BE504035 
AL1 09688 R23665 R26678 
AI1260g8 All 84746 AI148521 

AW189787 AI215430 AW268499 AW205930A1392907 BE093017 BE093019 BE093010 
T61430AI820646AIB21336 

AW450979AA136663 AA136656 AW419381 AA984358AA492073 BE1 68946 AA809054AW238038 BE011212BE011359 BE011367 
BE011368 BEQ11362 BE01121S BE011365 BE011363 

TABLE 3C . . . 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identllier (Gl) numl 

sequence of human chromosome 22." Dunham I. et al.. Nature (1999) 402:489495, 
Strand; Indicates DNA strand from which exons were predfcted. 
NLposHion ; Indicates nucleotide positions of predicted exons. 
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400B46 
401093 
401747 

401760 
401780 
401781 
402075 
402230 
402239 



22335-23166 

i-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-1312S8,131866- 

31-132575,133580-134011 

83126-83250,85320-85540,94719-95287 

28397-28617,28920-290453135-29296,29411-29567,29705-29787,30224-30573 
83215-83435,83531-83656,83740^3901,84237-84393,84966-85037,86290-86814 
121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

29782-29932 

38175-38304,42133-42266 

113765-113910,115653-115765,116808-116940 

40832-41362 



403047 3540153 

403381 9438267 

403903 7710671 

404403 7272157 

404440 7528051 

404875 9801324 

404877 1519284 

404977 3738341 

405932 7767812 



1095-2107 

142358-142546 

123525-123713 

38115-38691 

63448-63554 

17803-17931 

182212-182958 
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TABLE 4A: Preferred dianrosllcs for bladder cancer 

Pkey: Unique Eos probesel idenlifler number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD; Unigene number 

Unlgene Title; Unigene gene title 

R1 : 80th percentile of muscle-invasive bladder tumor (stage T2-T4) Als divided by the BCtli pexenti e of exoptiylic notvinvasive carcinoma (stage Ta) Ais 

R2 90tii percentile of bladder tumor Als minus background divided by 90lli peroentile of norma! body sample Als minus background, wliere background equals 

the 15th percentile of all sample Als 

R3 90th percentile of bladder tumor Als divided by the 90tli percentile of nomial body sample Als . 

































AU076643 


















ESTs, Weakly similar to S64054 hypotheti 






































N[yL002984 
















Hs'234734 














H5.83169 


matrix metalloproteinase 1 0nterstitlal 














protease inhibitor 3, skin-derived (SKAL 














small inducible cytokine subfamily A [Cy 










































gb:Homo sapiens (clone WR4.12VL) anti-th 






















































































































































































































































431639 




nS.266175 












AAj57185 






















































407939 










































412420 


AL035668 


Hs.73853 


bone morphogenetio protein 2 


1.31 


1.63 


2.22 


423217 


Nf«L000094 




collagen, 1/pe VII, alpha t (epWermolys 


1.27 


200 


1.67 


424206 


NNi003734 


HS198241 


amine oxidase, copper containing 3 (vase 


1.25 


0.16 


0.30 


415138 


C18356 


Hs,2959« 


tissue factor pathway inhibitor 2 


1.23 


1.70 


4.34 


424479 


AF064238 


Hs.149098 


smoothelin 


1.19 


0.27 




445537 


AJ245671 


Hs,12844 


EGF-like-domain, multiple 6 


1.17 




2^93 


417079 


U65590 


Hs.81134 


interleukln 1 receptor antagonist 


1.16 


0.82 


0.80 


421634 




Hs.106283 


hypothetical protein FLJ10262 


1.16 






439569 


AW602166 




CEGP1 protein 




2m 


2^42 


431346 


AA371059 


Hs!251636 


ubiquitin specific protease 3 


1.10 


1.64 


1.52 


448901 




Hs,22505 


hypothetical protein FLJ1Q159 




0.31 


0.31 


450983 


AA305384 




ER01 (S. cerevisiae)-like 


t03 


1.51 


1.42 


422424 




Hs!296638 


prostate differentiation factor 


1.02 


2.77 


2.68 


458781 


AI444821 




ESTs, Weakly similar to MPP3_HUMAN MAGUK 


1.00 


1.64 


5.45 


445413 


AA151342 


Hs!l2677 


CGI-147 protein 


1.00 




5.20 


432350 


NM_005865 


Hs.274407 


protease, serine, 1 6 (thymus) 


1.00 


1.55 


4.30 


403106 






C8000064*:gil10432393|emb|CAC10283.1|(A 


1.00 


1.48 


4.24 


402075 
404860 






ENSP00000251 056":Plasma membrane calcium 


1.00 


1.67 


3.95 






C1003394*:gi|12314272|emb|CAC00591,1l(A 


1.00 


1.40 


3.90 


434037 






WW domain-containing oxldoreduclase 


1.00 


1.58 


3.70 


405738 






CX000390':gil6014646lgb|AAF01438.1lAF187 




1.36 


2.95 


427585 


D31152 


H5.179729 


collagen, type X, alpha 1 (Schmki metaph 


too 


1.45 


1.60 


439898 
452567 


AW50S514 


Hs.209561 


KIAA1715 protein 


1.00 


1.28 


1.59 


D87120 


Hs.2g882 


prBdIcted osteoblast protein 


1.00 


1.10 


1.31 


401271 






C9000SSg*:gi|12314ig5|emb|CAB99338.1|(A 


1.00 


2.12 


1.00 


411339 


BE164598 


Hs.274251 


hypothetical protein FU20375; KIAAi797 


1.00 


2.05 


1.00 
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403005 






C21000027*:gil1817556|dbj|BAA1367Z1|(D 


1.00 


1.89 


1.00 


431146 


Z83850 




Human DNA sequence Irom PAC 82J 11 and co 


1.00 


1.89 


1.00 


434939 


AF161422 


Hs.306567 


Homo sapiens HSPC304 mRNA, partial cds 


1.00 


1.86 


1.00 


431753 


X76029 


Hs.2841 


neuromedin U 


1.00 


1.82 


1.00 


419121 


AA374372 


Hs.89626 


parathyroid hamione-like hormone 






1.00 


435505 


AF200492 


Hs.211238 






1^67 


1.00 


452401 
406397 


NM_007115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 
C16001447':gil12053709|emblCAC20419.11( 


too 


1.67 


1.00 
1.00 


4044B8 






m 030g58':Honio sapiens organic anion tr 


1.00 




1.00 


441206 
407853 


BE552314 


Hs.131823 
Hs.40499 


ESTs, Weakly similarto TEIW HUMAN tH.sa 
dickkopf (Xenopus laevis) homolog 1 


1.00 
f.OO 


1.49 
1.47 


1.00 


446119 


D29527 


Hs.290931 


ESTs 


1.00 


1.47 




406471 






Target Exon 


1.00 




1.00 


402110 
407911 
404829 


AF104922 


H3.41565 


C180OO178:g|11990779|emblCAC19649.1|(A 
growth differentiation factor 8 
C1002937':gi|7499208|pir|[r20993hypothe 


1.00 
1.00 


1.42 
1.37 


1.00 
1.00 


421925 


S80310 


Hs.109620 


acidic epididymal glycoprotein-like 1 




1.25 


1.00 


406076 
456522 


AL390179 


Hs.13755 


Homo sapiens mRNA; cDNA DKFZp547P134 (fr 
f-box and WD-40 domain protein 2 


1.00 
1.00 


1.19 
1.09 


1.00 
1.00 


415018 
409357 


AW138239 
M73628 


Hs.78977 
H3.54415 


proproteln convertase subtilisinfkexin 1 
casein, kappa 


1.00 


1.03 




436684 


AW976319 


Hs.94806 


ATP-binding cassette, sub-family A (ABC1 


1.00 




0.84 


402522 


BE1 52396 


Hs.21590 


hypothetical protein DKFZp564O0523 
C1000568':gi|12697965ldbjiBAB21801.1|(A 


1.00 
1.00 


0.B0 


0.67 


405735 






ENSP00000252164*:KIAA1578 protein (Fragm 


1.00 


0.86 


0.56 


4O190S 






ENSP00000252232*:Sten)l regulatory eleme 


1.00 


0.65 


0.52 


404152 






C6000931':gi|9558454|db]|BAB0339ai| (AB 


1.00 


0.58 


asi 


418693 


AI750878 


Hs.87409 




1.00 


0.85 


0.51 


451375 


A1792066 


Hs.283902 


Homo sapiens BAC clone RP11-4B1 J1 3 from 


1.00 


0.46 


0.38 


430132 


AA204686 


Hs.234149 


hypolhelioal protein FU20647 


1.00 


0.84 


0.33 


456983 


AI081687 


Hs.11355 


thymopoietin 


1.00 


0.61 


0.29 


438581 


AW384815 


Hs. 149208 


KIAA1 555 protein 


1.00 


0.60 


0.28 


409038 


T97490 


Hs.60002 


small Inducible cytokine subfamily A (Cy 


1.00 


0.39 


0.19 


409196 


NM.001874 


Hs.334873 


carboxypeplidase M 


1.00 


0.43 


0.13 


410023 


AB017169 


Hs.57929 


slit (Drosophila) homolog 3 


1.00 


0.30 


0.12 


420874 


m 000055 


H3.1327 


bulyrylcholineslerase 


1.00 


0.30 


0.08 


415165 


AWB87604 


Hs.78065 


complement component? 


1.00 


0.08 


0.06 


425545 


N98529 


Hs. 168295 


Homo sapiens, clone MGC12401, mR^I^ com 


1,00 


0.10 




448256 


BE614149 


Hs.20814 


CGI -27 protein 


0.96 


1.32 


1.66 


417389 


BE260964 


Hs.82045 


midkine (neurite growth-promoting factor 


0.95 


3.15 


2.34 


403214 






I^IM_016232':Homo sapiens interleukin 1 re 


0.94 




2.51 


414799 


A1762416 


Hs.77326 


insulinlike growth factor binding prote 


0.92 


1.87 




406665 


U22961 


Hs.184411 


albumin 


0.92 






401519 






C15000476':gi|12737279lrellXP_012163.1| 


0.86 


1.46 


3.44 


417501 


AL041219 


H3.82222 




0.87 




0.60 


409532 




Hs.55279 


serine (or cysteine) piDleinase inhibito 


0.85 


1.38 


1.43 


405494 






C2001837-:gill2697903|dbi|BAB21770.1|(A 


0.83 




4.65 


444171 


AB01fl249 


H3. 10458 


small inducible cytokine subfamily A (Cy 


0.80 


0.91 


0.91 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similarto [)AP1_HUIlMN DEATH 


0.79 


0.58 




436396 


AI683487 


Hs.1 52213 


wingless-type MMTV integration sitefaml 


0.77 




2.37 


426716 


NM_006379 


H3.171921 


ssma domain, immunoglobulin domain (Ig), 


0.75 


1.13 


1.18 






Hs.251664 


insulin-like growth factor 2 (somalomedi 








413753 


U17760 


Hs.75517 


laminin, beta 3 (nicein (125kD), kalinin 




170 


4.96 


426322 


J05068 


Hs.2012 


Iranscobalarrin 1 (vitanrjn B1 2 binding pr 


ae? 


1.50 


1.36 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


0.56 


2.05 


Z46 


422232 


AF0ig225 


Hs.1 14309 


apollpoproteinL 


0.55 


3.91 


3.92 


409757 


NII/L001898 


H3.123114 


cystatinSN 


0.53 


2.72 


Z93 


427450 


AB014526 


Hs.178121 


KIAA0626 gene product 


0.52 


1.34 


1.97 


414555 


IM98569 




phospholipaseA2, group IIA (platelets, 


0.50 


1.04 


1.05 


423774 


L39054 


H3!l702 


interleukin 9 receptor 


0.49 


2.81 


6.46 


404977 






Insulin-like growlh factor 2 (somatomedi 


0.28 


5.78 


5.17 


428336 


AA503115 


Hs.1 83752 


microseminoprotein, beta- 


0.21 


1.47 


1.56 


451668 


Z43948 


Hs.326444 


cartilage ackiic protein 1 


0.18 


4.05 


3.60 


428651 


AF196478 


Hs.1 88401 


annexInAlO 


0.17 


5.14 


27.75 


421110 


AJ250717 


Hs.1 355 


calhepsin E 


0.12 


5.49 


45.35 



TABLE 48 

Pkey: Unique Eos probeset IdenliTier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pf(ey CAT Number Accession 

431146 32854J Z83850AA459717AW965384AA333635 
434037 3791 8J AF1 16601 All 1 0691 AF063566 
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TABLE 4C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbenk Idenlirier (Gl) numbers. 'Dunham I. et al.' refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted, 
Ntj)ositlon: Indicates nucleotide positions of predicted exons. 



401271 
401519 
401905 
402075 
402110 



404977 
405494 
405735 
405738 
406076 
406397 
406471 



Ref 

9797373 
6649315 
8671966 
8117407 
8131678 
9798493 
9211324 



NLposltion 
157315-157950 

1 53965-1 54441, 1 56599-1 568 1 9 

121907-122036,122804-122921,124019-124161, 

173889-174062 

20605-20731 

54696-54777 

16945-17053,20018-20403 

77162-77350,81338-81511 

76723-77027,79317-79484 

41111-41281,4549545716,47801-47910 

64835-64994 

4913-5093,7310-7469,9472-9621,9951-10082 



43081-43229 
70284-70518 
29854-29976 
44370-45410 
89972-90319 
127317-127454 
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TABLE 5A: Genes upregulaled in bladder cancer 

Pkey; Unique Eos probeset identffler number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 
Unigene Title: Unigene gene title 

R1 90th percentile of bladder tumor Als divided by the 90th percentile of normal body sample Als 















AI204995 




gb:an03c03j(1 SIralagene schizo brain SI 






AI637993 
























CI 1000517*;gi|7293105|gb|AAF48490.1| (AE 
















AA448542 


Hs.251677 








AI797034 






















Hs.239388 


Human DNA sequence from clone RP1-304B14 




409041 


AB033025 




Hypothetical piotein, XP 051860 (KIAA1 19 






























C19000728*:gi|12585552|sp|Q9Y2Q1|Z257_HU 








Hs.124434 










Hs 41379 








AF143867 


Hs.337588 


ESTs, Moderately simitar to S65657 alpha 




















DKFZP434G032 protein 








































ESTs, Weakly similar to PC4259 ferritin 










NM_025206:Hofno sapiens hypothetical prot 


































































AW411307 








































AW674093 


Hs.334822 


hypothetical protein MGC4485 
















BEOS 1833 




gb:IL2-BT0731-260400-076-F04 BT0731 Homo 






AI738616 










AW004056 
























Homo sapiens cDNA: FLJ22747 Us, clone K 




















Homo sapiens cDNA FIJ14408 lis, clone HE 














433078 


AW015188 


Hs.121575 


Homo sapiens cDNA FLJ12231 lis, ctone MA 


2.40 


425997 


AK000086 


Hs.165948 


hypothetical protein FLJ20079 


2,38 


418322 


AA284166 


Hs.84113 


cyclin-dependent kinase inhibitor 3 (CDK 


2.35 


452012 


AA307703 


Hs.279766 


kinesin family member 4A 


2.34 


445600 


AF0348G3 


Hs.12g53 


PTPRFintsraclIng protein, binding prote 


2.33 


418941 


AA452970 


Hs,239527 


El B-55kDa^socialed protein 5 


2.33 


454609 






gb:MR4-ST0125^21 199^17-d08 ST0125 Homo 


2.28 


444476 


AF020038 


Hs.11223 


isxilralB dehydrogenase 1 (NADP), sdub 


226 


420005 


AW271106 


Hs.133294 


ESTs 


2.22 


439826 


NM_014965 


Hs,6705 




222 


405531 






Target Exon 


2.21 


435569 


BE439539 


Hs.279837 


glutathione S-transferase M2 (muscle) 


2.18 


404394 






ENSP00000241075;TRRAP PROTEIN. 


2.17 


427479 


BE410092 


H5.178471 


KIAA0798gene product 


2,17 


435904 


AF261655 


Hs.8910 


1,2-alpha-mannosldase IC 


2,13 


431620 


AA126109 


HS.2649B1 


2'-5'-ollgoad6nylate synthetase 2 (69-71 


2.12 


426682 


AV660038 


Hs.2056 


UDP glycosyltransferase 1 farriily, polype 


2.10 


422765 


AW409701 


Hs.1578 


baculoviral lAP repeal-conlalning 5 (sur 


2,10 


451385 


AA017656 




gb:ze39h01.r1 Soares retina N2b4HR Homo 


2.09 


403477 






C3002160*:gi|7662420|reflNP_05573ai | Kl 


2.09 


417151 


AA194055 


Hs.293858 


ESTs 


208 


448262 


AW880830 


Hs.186273 


ESTs 


2.07 


415192 


D17793 


Hs.78183 


aMo-kelo reductase family 1, member C3 


2.04 


402994 






NI«L002463*:Homo sapiens myxovirus (intu 


204 



165 
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407581 
410197 
427122 
436481 
436251 
401961 
434042 
447532 
418526 
429612 
422164 



412115 
414809 
432210 



Hs.63287 
Hs.73239 
Hs.77356 



P3ECSL 

3-hydroxy-3-nnethylglutaryl-Coenzyme A sy 
HER2 receptor tyrosine kinase (c-erb-b2, 
HSPC150 protein similar to ublquiSn-con 
nucleolar protein (KKE/D repeal 
NM_021626:Homo sapiens serine carboxypep 
Homo sapiens, Similar to tumor different 
hypolhelical protein FLJ20607 
solute carrier family 16 (monocarboxyllc 
pituitary tumor-transforming 1 
corlic al Itiymocyte receptor (X. laevis 
solute carrier family 1 (high affinity a 



hypothetical protein FU10901 
transferrin receptor (p90,CD71) 
Homo sapiens, done MGB3S44662, mRNA, 



Hs.347524 
Hs.291904 
Hs.25237 



Hs.288649 

Hs,2942 
Hs.99389 
HS.22B80 
Hs.2860 
Hs.82906 
Hs.2340 

Hs!37129 

HsB5962 

Hs.151139 

Hs.77152 

HS.334S86 

Hs.8127 

H3.75725 

Hs.150826 

HS.27B422 

Hs.167013 

Hs.179657 
Hs.83213 



cefaclor required for Sp1 transcripSona 
accessory proteins BAP3i;BAP29 
mesenchymal stem cell protein DSCD76 
ctiemokine (C-C motif) receptor 7 
SREBP CLEAVAGE-ACTIVATING PROTEIN 
interleukIn-1 receptor-associated kinase 
MUF1 protein 

UDP-Gal:betaacNAc beta 1 .4- galactosyl! 
hypothetical protein l«tGC3077 
C11002190*:9i|12737279|reflXP_0121 63.11 
macrophage stimulating 1 receptor (c-mel 



sodium channel, nonvoltage-galed 1, beta 

hyaluronan synthase 3 

E74-like factor 4 (ets domain transcript 



CATX-8 protein 
DKFZP586G1 122 protein 
NhL001436*:Homo sapiens fibrillarin (FBL 
dynam'n 2 

plasminogen activator, urokinase recepto 
fatty acid binding protein 4, adipocyte 
hepaloma-derived growth factor (high-mob 



Hs.333418 
HS.12B152 
Hs.57101 



FXYD 
ESTs 

minichromosome maintenance deficient (S. 
G protein-coupled receptor 56 
singed (Drosophilaj-like (sea urotiin fas 
cadherin 1, type 1 , E-cadherin (epittieli 
Homo sapiens cDNA FLJ1C195 fis, ctone HE 
p63-induoed protein PIGPC1 
v-erb-b2 avian erythroblastic leukemia v 
hypothetical protein FU12442 
claudin 7 

chaperonin containing TCP1 , subunit 2 (b 
FK506-binding protein 4 (59kD) 
fructose-1,6-b 
KRAB-assodale . 
hypothetical protein FU10604 
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451541 BE2793B3 Hs.26557 

406906 Z25424 

429504 X99133 Hs.204238 

414002 NM_006732 Hs.75678 

431369 BE184455 Hs.251754 

TAB LE s g 



plakophilin 3 

gb;H.sapiens protein-serine/lhreonineki 
lipocalin 2 (oncogene 24p3) 



secretory leukocyte protease inhibitor ( 



AW460979 AA136653 AA136666 AW419381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1212 BE011359 BE01 1367 

BE011368 BE011362 BE01121S BE011365 BE011363 

AW265494 AA4SSg04 AA1 95677 AW265432 AW991605 AA456370 

AA628980AI126603BE50403S 

AA017656AA017374AA019761 

AW810204 AW810555 AW810196 AWBIOBig AW810507 



■Dunham I. el al.' refers to the publication entitled "The DNA 



TABLE 5C 

Pkey: Unique nunnber corresponding to an Eos probesel 
Ref: Sequence soun». The 7 digit numbers In this column are Genbank Idendfier (Gl) numbers. 

sequence of human diromosome 22.' Dunham I. et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
I^Lposltlon: Indicates nudeolMe positions of predicted exons. 



401961 
402239 
402260 
402306 



403477 
404394 
404917 
406631 



NLposltion 

173043-173564 

71448-71574 

15736-16352 

124054-124209 

38175-38304,42133-42266 

113765-113910,115653-115765,116808-116940 

40832-41362 

143456-143626,143608-143935 

4727-4969 

119837-121197 

111834-112008 

37121-37205,37491 •37762,4105341140,41322-4159a4177341919 
4933049498 
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Pkey: Unique Eos probeset identifier numlier 

Btftocn: Exemplar Accession number, Genbanl< accession number 

UnlgenelD: Unigene number 
Urigene Title: Unigene gene title 

R1 90th percentile of bladder tumor Als divided by the 90ti) percentile of normal urothelium biopsy Ais 





90th percentile of bladder 


tumor Als divided by the 90th percentile of normal urothelium biopsy 








UnigenelD 


Unigene Title 








AW014875 


Hs.1 37007 


ESTs 


11.31 








Hs.75294 


corticotropin releasing hormone 








AJ250717 




cathepsin E 








H60720 
















cytokeratin 20 

secreted phosphoprolein 1 (osteopontin, 








AU076643 










AF217513 


Hs.279905 


clone HClO310PRO0310p1 








Y00787 




interleukin 8 








BE267931 


Hs.78996 


proliferating cell nuclear antigen 












topoisomerase (DNA) il alpha (170kD) 








AW957446 














Hs.83169 


matrix melalloproteinase 1 (interstitial 










Hs.166994 


FAT tumor s oppressor (Dtosophila) homolo 










Hs.176663 


Fc fragment of IgG, km affinity illb, r 




498 




AW958544 




normal mucosa of esophagus specific 1 








AW373062 




nuclear receptor subfamily 1 , group 1, m 








AL035737 


Hs.75184 


chitinase 3-like 1 (cartilage glycoprote 








W20010 


Hs.75823 


AUI -fused gene from chnjmosome 1 q 












C 1 002652':gl|5443271splQ04799|FMO5_RABIT 












protein kinase (cAMP-dependsnt, cataiyti 








AA503115 




microseminopTDtein, beta- 








BE6 13348 


Hs.211579 


meianoma celi adhesbn molecule 








BE003054 




matrix mstaliopFoteinase 12 (macrophage 










Hs.1 82362 










NM_001110 




a disintegrin and metalioproteinase dome 












keratin 6A 








AF1 96478 




annexinAlO 








R02740 




pulative chemokine receptor; GTP-binding 








M31126 




matrix metalioproteinase 11 (slromelysin 








BE264974 




thyroid hormone receptor interaolor 13 








N55669 


Hs.333823 










BE267756 


Hs.180312 


mitochondrial ribosomal protein S16 








NM_001982 


Hs.1 99067 


v-erb-b2 avian erythroblastic leukemia v 








AA1 73992 


Hs.7956 


ESTs, IVtoderateiy sinniiaf to ZN91_HUIVIAH Z 


4.90 






AA284166 


Hs.84113 


cyclln-dependent kinase Inhibitor 3 (CDK 
ESTs 








AA205625 


Hs.208067 


4.84 






AF229181 


Hs.1 36644 


CS box-containing WD protein 










Hs.1 00724 


peroxisome proliferative activated recep 








AW960564 




transmembrane 4 superfamlly member 1 








AB020657 


Hs.197298 


f^SI-binding protein 








BE086548 


Hs.42346 


caicineurin-binding protein calsaicln-1 








AW770994 


Hs.30340 


hypothetical protein KIAA1165 








BE093589 




hypothetical protein FLJ23468 








AK001432 




Homo sapiens cDW, FLI10670 fis, clone W 








AB012113 


Hs'iesso^ 




464 




427490 


Z95152 


Hsj78695 




4^63 


5^12 


412490 




Hs.288850 


Homo sapiens cCWA; FU22528 fis, clone H 


4.61 


4.61 


418030 


BE207573 


Hs.83321 




4.B0 




401192 








4.60 


4.29 


426761 


AI015709 


Hs.172089 


Homo sap^ns mRNA; cDNA DKFZp586l2022(f 


4.59 


3.51 


452747 


BE1 53855 


Hs.61460 


Ig superfamily receptor LNIR 


4.58 


5.33 


449618 


AI076459 




KiAA1272 protein 


4.58 


4.58 


423725 




Hs.132127 


hypothetical protein LOC57822 


4.55 


4.55 


415701 


NM_003878 


Hs.78619 


gamma^lutamyl hydrolase (conjugase, fol 


4.52 


4.70 


446742 


AA232119 


Hs.16085 


putative Gfrotein coupled receptor 


4.49 


4.11 


419433 




Hs.7396 


hypothetfcal protein FU23182 




4.48 


412326 


R07566 


Hs.73817 


small inducible cytokine A3 (homologous 


4^47 


4.47 


427528 


AU077143 


Hs.179565 


minlchromosome maintenance deiicieni (S. 


4.45 


4.45 


444371 


BE540274 


Hs.239 


forl(head box Ml 


4.44 


3.87 


444006 


BE395085 


Hs.1 0086 


type 1 transmembrane protein Fn14 


4.43 




424308 




Hs.1 54443 






4^43 


401093 






cSoS^giiesSV^^se^^^^^ 


m 


4.40 


447644 


AW861622 


H5.108646 


Homo sapiens cDNA FU14934 fis, clone PL 


4.39 


4.39 


417933 


X02308 


Hs.82962 


thymidylale synthetase 


4.38 


4.35 


409461 


AA38216g 


Hs.54483 


N-myc (and STAT) Interaclor 


4.36 


3.68 


401451 






l^f^_0044g6*;Homo sapiens hepatocyte nude 


4.35 


4.35 


450746 


D82673 


Hs.278589 


general transcriplkm factor II, i 


4.35 


3.36 
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434203 
417616 
416815 



UDP-N-acetyl-alpha-D-galactosamineipolyp 



NMJ01898 
AA219691 
AW674093 
T51454 



426283 
446849 
400843 
449722 
405506 
420344 
426997 
456525 



411678 
429113 
428428 
442932 
429083 



447973 
404875 

411299 
418827 
446839 
407137 



418478 
414761 
413670 
424840 



BE2B2478 

H03109 

W70171 



429238 
420159 
400289 



oystatin SN 

RAB6 inleracling, kinesin4ike (rabkines 

hypothetical protein MGC4485 

Homo sapiens cDNA: FLJ21930 Ss, done H 

ESTs 

KIAA0175 gene product 
ESTs, Weakly similar to 138022 hypotheti 
GA733-2 aniigen; epithelial glycoprotein 
kynureninase (L-kynurenine hydrolase) 



,1119844 Hs.49476 



Hs.6793 



Hs.23044 
Hs.246315 
Hs.82547 
Hs.1 27826 
Hs.20141 



NM_00310S':Homo sapiens soitilin-relaled 
cyclin B1 
Target Exon 

putative G protein-coupled receptor 
peptidylprolyl isomerase F (cyclophilin 
S100 calcium-binding protein A8 (calgian 
3-hydroxy-3-methylglutaryl-Coenzyme A sy 

collagen, type XVII, alpiia 1 

small inducible cytokine A4 (homologous 

mesoderm specific transcript (mouse) liom 

hypothetical protein FLJ10549 

Eos Control 

uracil-DNA glycosylase 
fatty acid binding protein 6, ileal (gas 
plasminogen activator, urokinase recepto 
laminin, beta 3 (nicein (1 25kD), kaSnin 
S100 calcium-binding protein A8 (calgran 
Homo sapiens winged helixiforkhead trans 
Homo sapiens clone TUA8 Cri-du-chalregi 
squaleneepoxidase 

prostaglandln-endoperoxlde synthase 2 (p 
oyclln-depondent kinase 7 (homolog of Xe 
bromodomain adjaoent to zinc finger doma 
BCL2-related protein A1 
platelet-activating factor acetylhydrola 
RAD51 (S. cerevislae) homolog (Ecdi Re 
UDP-N-acetyl-alpha-D-flalaclosamine:polyp 



RalGEF-like protein 3, 
similar to S.cerevisiaeSSM4 
NM_0228ig*:Homosaplens 



KT021 

mitotic spindle coiled-coil related prot 
gb:ye53h05.s1 Soares fetal Twer spleen 
caspase 4, apoptosis-related cysteine pr 
Homo sapiens prosteln mRNA, complete cds 
KIAA1821 protein 
integrin, alpha 6 

ATPase, Class I, type SB, member 1 
cyclin-dependent kinase inhibitor 2A (me 
enhancer of zeste (Drosophila) homolog 2 
hypothetical protein, expressed in osteo 
extra spindle poles, S. cerevisiae, homo 
ESTs 

annexin A2 

chromosome 20 open reading frame 1 
KIAA0746 protein 
neuron-specific protein 
HT018 protein 

uridine monophosphate kinase 
NIVL025065:Homo sapiens hypothetical prot 
heat shock 70kD protein 4 
glucuronidase, beta 
ATP binding protein associated wTh cell 
cadherin 3, type 1, P-cadherin (placenta 
protein tyrosine phosphatase, receptor t 
Homo sapiens cDNA: FU21245fls, clone C 
matrix metalloprcteinase 10 (stiomelysln 
CDC2Bpniyn kinase 2 
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404977 
422663 
434061 
418113 
431689 
411943 
420005 
453450 
410315 
428839 
437469 
407151 
428157 
450293 



451940 
407722 
422282 



417880 
447957 
418004 
417275 
431211 
401781 
407242 



443162 
441495 
449246 
401780 
417079 
422158 



M431860 
AB012193 
AK00110Q 

AW500087 

AW024973 

AI272141 

AA305688 

BE502436 

AW271106 

AW797627 

AI638871 

AI767756 

AW753112 

H25836 

A1738719 

N36754 

AK001838 
NMJ04415 
AA912347 
NM_004293 



AK000517 

BE281128 

AK000785 

AF1 23050 

NM 004833 

AL359053 

AB018345 

AI735759 

BE252241 

AF019225 

Ml 8728 

BE241595 

NM_014821 

U37519 

X63578 



Hs.181174 
Hs,183874 
Hs.41690 



Hs.82302 
Hs.15514 
Hs.301527 
Hs.1 98427 



ESTs, Weakly similar lo T19201 hypolheli 

cul)n4A 

desmocollin 3 

Insulin-like growth factor 2 (somatomedl 
zinc finger praleln 175 
NPDOOg protein 

SRY (sex determining region Y)-box 4 
UDP-Gal:betaacl4Ac beta 1,3-galactosyltr 
ESTs, Weakly similar to S44608 C02F5.6 p 

ESTs 

ADP-ribosylatlon factor 6 
Homo sapiens cDNA: FLJ22524 fis, clone H 
Homo sapiens cDNA FLJ14814 fis, c:one NT 
irypotiietical protein MGC3260 
— I, Moderately similar lo unknown [H.s 



Hs.1 36585 
Hs.239147 
Hs.21889 
Hs.9238 



Hs.44532 
Hs.105115 
Hs.57664 
Hs.27657 
Hs.52620 



H5.20126 
Hs.87539 
Hs.295449 
Hs.323733 



liypotheti'cal protein FU0Q026 
Target Exon 

senim/glucocortlcoid regulated kinase 

desmoplakin(DPI, DRI) 

ESTs, Weakly similar to JC5314 CDC28/cdc 

guanine deaminase 

Homo sapiens cDNA FU12978 fs, clone NT 
hypolhelicai protein FU23516 
adaplor-reiated protein complex 1, gamma 
hypottietical protein FU20510 
TONDU 

Homo sapiens. Similar to epsin 3, done 



Homo sapiens mRNA full length insert cDN 
KIAA0802 protein 
integrin, beta 8 

pyridoxal (pyridoxine, vitamin B6) Unas 
apolipoprotein L 
Target Exon 

gb:Human nonspecific crossreacting aniig 

seieclin L (lymphocyte adhesion molecule 

KIAA0317geneprodxl 

aUehyde dehydrogenase 3 family, member 



Hs.184276 solute carrier family 9 (sodium/hydrogen 

Hs.234a96 geminin 

Hs.430 olastlnKlisoform) 
Hs.9029 

Hs.127039 ESTs 
Hs.23363 



U 10983 



hypoHieScal protein FL 
NM_005557':Homo sapiens keratin 1 6 (Ibca 
Hs.31134 Interieukin 1 receptor antagonist 
Hs. 1 1 2408 81 00 cateium-binding protein A7 (pson'as 
Hs.56105 ESTs 

Hs.175563 Homosaplens mRNA; cDNA DKFZp564N0763 (f 



412636 13165 1 NM_004415AU)31()58M77830BE149760AW752599AWB48723AW376697AW376817AW376699AW648371 AW376782 AW848789 
AW361413AW849074AW997139AW799304AW799309 BE077020BE077017BE185187AW997196 BE156621 BE179915 BE006561 
BE143155 AW890985 BE002107 AW103521 AA857316 AW383133 BE01137BAW170253 BE1 85750 AW885475 BE160433 J05211 BE082576 
BE082584 BE0O4O47 AW607238 AW377700 AW377699 BE082526 BE082505 BE082507 BE082514 AW178000 AW177933 Ai905935 
AW747877AW748114BE148616AW265328AW847678AW847688AW365161 AW36514BAW365153AW365166AW365175AW365157 
AW365154AW068840BE005272AW365145BE00t925BE182166 BE144243 BE001923Ai951766 Ai43451B BE184920 BE184g33 Ai284090 
BE184941AW804674 BE184924C04715W394B8AW995615BE184948BE159646AW606B53AA099891 AA131128AA337270AA340777 
AW384371 AA852212 R58704 AW366566 AW364859 AA025851 AA025852 AA455100 AA71 995B AW352220 AW996245 B El 65351 BE073467 
AA377127 AWB90264 AW609750 AW391912 AW649690 T87267 AW853812 AA852213 W74149 BE009090 AAOS6401 H9101 1 AW36852g 
AW390272 C18467 AW674920 N57176 AA026480 AW576767 H93284 AA026863 AW177787 AA026654 AW1777fl6 BE092134 BE092137 
BE092136AW1 77784 AI022862BE091653AW376811AW848592AA040018BE185331 BE182164AA368564AW951576T29918AA131077 
W95048 W25458 AW205789 H90899 N29764 W32490 R209(M BE1671B1 BE167165 N84767 H27408 H30146 AI190590 C03378 AI554403 
A1205263 AA128470 Ai392926 AF139065 AW370813 AW370B27 AW798417 AW798780 AW798883 AW798569 R33557 AA149190 C03029 
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TABLE 6C 



AW177783 M088866 AW370829 AA247686 BE002273 AI760816 A1439101 AW879451 AI700963 AA451923 AI340326 AI590975 T48793 
A1568096 A1142882 AA039975 AI470146 AA94B936 BE0S7737 BE067786 W19287 AA644381 AA702424 AI417612 AI306554 AI686869 
A1668892AW190555A1571075AI220573AA056527AI471874AI304772AW517828AI915596A1627383A1270345AW021347AW1 66807 
AW105614Al34607BAA552300 W95ff70Al494069Al911702AA149191AA026864AI83004gAI887258AW780«5AI910434AI819984 
AI858282 AI07B449 AI025932 AI860584 AI635878 AA026047 AA703232 D12062 AW192085 AA658154 AW514597 AW591892 T87181 
AA782066 AW243815 AW150038 AW268383 AW004633 AI927207 AA7821 09 AW473233 AI804485 AW169216 AI572669 AA6021 82 
AW015480AW771865AI270027AA961816AA283207AI076962AI498487AI348053AI783914H44405AW7g9ll8AAl28330AA516500 
AA918281 W02156A1905927AA022701 W38382 R20795 T77861 AW860878 

AW960564 AA092457 T55890 D56120T92525 AI815987 BE182608 B E1 82595 AW080238 M90657 AA347236 AW961686 AW176446 
AA304671 AW583735 T61 71 4 AA316968 AI446615 AA343532 AA0834B9 AA488006 W62095 W39480 N57402 D82638 W25540 W52847 
D82729 D669SOBE619182AA315188AA308636AA112474W76162AA088544 H52265AA301631 H80982AA1137B6 BE620997 AW651691 
AA343799 BE613669 BE54718Q BE546656F11933AA376800AW239185AA376086BE544387 BE619041 AA452515AA001806 AA190873 
AA180433 M159546 F00242AI940609AI940602AI189753T97663T66110AW062896AW062910AW062902A1051622AI82B930AA102452 
AI685095 AI819390 AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606BOO AW750567 AW510672 A1250777 AA083510 
AW6291Q9 AW513200 AA921363 AI677934 A1148698 A195585B AA173825 AA453027 AI027865 AW375542 AA454099 AA73301 4 A15913B4 
R79300 Rfi0fl23 AA843108 AA626058 AA844a9B AW375550 AA889018 AI474275 AW205937 AI062270 AW38S1 1 7 AW3881 1 1 AA699452 
AI242230 N47476 H3B178 AA366621 AA1 13196 AA130023 H39740 T61629 A1885973 AW083671 AA1 79730 AA305757 A1285455 N83956 
AA216013 AA336155 AW999959 T97525 AA345349 T91762 AA771981 A1286092 A1591386 BE392486 BE385852 AA682601 AI682884 
AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW752038 C06300 

AW373062 T55662AI299190BE17421OAW579001 H01811 W40186R671Q0A1923886AW952164AA628440AW898607 AW898616 
AA7091 26 AW898628 AW898544 AA947932 AW898625 AW898622 A12761 25 Al 1 85720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 A1288245 All 86932 D52654 D5501 7 D5271 5 D52477 D53933 D54679 AI298739 AI146984 AI922204 N98343 BE1 7421 3 AA846571 
AI813854 AI21451 8 AI635262 AI13g455 AI707S07 AI6980S5 AW8S452a AI024768 AI004723 AW087420 AIS65133 N94964 AI268939 
AW513280 A1061126 AI435818 AI859106AI360506 AI024767 AA513019 AA757598 X56196 AA902959 AI334784 AIB60794 AA010207 
AW89C091 AW513771 Aig51391 AI337671 752499 AA890205AI640g08H75966AA463487AA358688AI961767AI8662gSAA780994 
AI985913 BE174196 AA029094 AW592159 T55581 N79072 A1611201 AA910812 A1220713 AW1493C6 A1758412AA045713 R79750 N76096 



3 0 Pksy: Unique number corresponding to an Eos probesst 

Ref Sequence source. Tiie 7 digit numbers in tiiis column are Genbank IdenUTier (Gl) numbers. 'Dunham I. et al.' refers to the publicalion entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DI^A strand from wliich exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



400750 8119067 

400843 9188605 

40 401093 B516137 

401192 9719602 Minus 89669-7U101 

401451 6634068 Minus 119926-121272 

4017B0 7249190 Minus 28397-28617,28920-29045.29135-29296,29411-29567,29705-29767,30224^0573 

4017B1 7249190 Minus 83215-83435,83531-83656,8374043901,84237-84393,84955-85037,86290-86814 

45 402230 9966312 



9801324 Plus 

3738341 Minus 43081-43229 

7107731 Minus 142358-142546 

6466489 Plus 80014-80401,80593-81125 
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TABLE 7A: Genes downregulaled in bladder cancer 

Pkey: Unique Eos probeset idsnlMer number 

ExAccn: Exemplar Accession number, Genbanlt accession number 

UnlgenelD: Unigene number 
Unigene Title: Unigene gene title 

R1 9OII1 percentile of normal urolhelium biopsy Als divided by 751I| percentile of bladder tumor Ais 

R2 90tli percentile of normal urothelium biopsy and normal bladder Als divided by the 90tli percentile bladder tumor Ate 









Unigene Title 


R1 










C21000152:gll6226483|sp|Q52118|YMO3„ERWS 4.86 














Z48 




AA577730 


Hs,1 88684 


ESTs, Weakly similar to PC4259 ferritin 








AW298631 




Wolf-Hirschhorn syndrome candidate 1-lik 




266 








Target Exon 












alcohol dehydrogenase IB (class 1), beta 








AA244170 




gb:nc05h02.s1 NCLCGAP_Pr1 Homo sapiens 




a42 






Hs.11 0373 


ESTs, Highly similar to T4262B secreted 




a24 




NIUL003881 


Hs.1 94679 


WNT1 Inducible signaling pathway protein 
















3.69 




AW973454 


Hs.238442 


ESTs, Moderately similar to ALU7_HUMAN A 










Hs!270840 


ESTs 








AW304226 




biphenyl hydrolase-like (serine hydrolas 








AVV83726B 


Hs.279639 


Homo sapiens mRNA; cDNA DKFZp586ivl2022 (f 












ESTs 












neural cell adhesion molecule 1 


1.65 








Hs.248844 


ESTs, Weakly similar to ALU 1 _H UUAN ALU S 












ENSP00000217726*:Laminin alpha-1 chain p 












glyceronephosphate 0-acyltransfeiase 












platelet-derived growth factor receptor, 








AW612481 


Hsjo4105 


ESTs 




2!89 




ALi)79707 


Hsl207443 


hypothetical protein MGC10848 




2.80 




AL046859 




protein kinase (cAMMependent. calalyti 




3.49 




NM 000426 


Hs.323511 


Homo sapiens cDNA: FLJ23176 lis, clone L 




193 




AW976574 


Hs.125103 






5.13 




AF1 79897 




Meis (mouse) hcmolog 2 




4.07 




AL1 18671 








2.86 




A1478250 




ESTs 




1.95 








Homo sapiens mRNA; cDNA DKFZp586B211 (fr 












chondrollin sulfate proteoglycan 4 (mela 












peripheral myelin protein 22 










Hs.306121 


CDC20 (cell division cycle 20, S. cerevl 










Hs 94769 


RAB23, member RAS oncogene family 












ESTs 


1,20 










ESTs 












poly(A)-blndlng protein, nuctear 1 








AK001015 




BCL2-assacia)ed alhanogene 2 




&40 




AI193355 


Hs.160316 




100 








Hs!237731 


ESTs 


roo 




417124 


BE?2OT2 


Hs!25338 


ESTs 


1,00 


473 


443993 


A1620661 


Hs.296276 


ESTs 


1.00 


4.39 


406303 






C16000922al|7499103|pir||T20903hypothe 


1.00 


4.37 




AW89ia02 


Hs.296276 


ESTs 


1.00 


4.37 


422195 


AB007903 


Hs.113082 


KIAA0443 gene product 


1.00 


4.35 






Hs.32478 


ESTs 


1.00 


3.90 




AW207669 


Hs.5630 


Homo sapiens cONA FU13329 fe, clone OV 


1.00 


3.90 


417159 


R01761 




gb:ye31f10.s1 Scares fetal liver spleen 


1.00 


3.82 


445607 




Hs.30156 


ESTs, Weakly similar to unnamed protein 


1.00 


3.62 


406274 






Target Exon 


1.00 


3.59 


410611 


AW954134 


Hs.20924 


KIAA1 628 protein 


1.00 


3.06 


426495 


NM_001151 




solute canrier family 25 (mibchondrial 


1.00 


2.89 


422292 




Hs!l 14360 


transforming growth factor beta-stimuiat 


1.00 




413040 




Hs.12321 


sodium calcium exchanger 


1.00 






NMJ)0SI08 


Hs.211569 


G protein-coupled receptor kinase 5 


1.00 


2.05 


456607 


AI660190 


Hs.1 06070 


cyclin-dependent kinase inhibitor 1 C (p5 


1.00 




429143 


AA333327 


Hs.1 97335 


plasma glutamale carboxypeptidase 


0.97 


2^45 


400288 


X06256 


Hs.149609 


integrin, alpha 5 (fibronectin receptor, 


0.90 


2.47 


442498 


U54617 






0.88 


5.08 


414449 


AA557660 


HsJ6152 




0.88 




412014 


AI620650 




ESTs" Weakly similar to A46010 X-linked 


0.78 


l]88 


425100 




Hs.1 54667 


supervlllln 


0.70 


3.90 


432094 


AI658580 


Hs.81426 


Homo sapiens mesenchymal stem cell prote 




3.41 


427818 


AW511222 


Hs.1 93765 


ESTs 


o!63 


3.75 



TABLE 7B 



Pkey; Unique Eos probeset Wentmer r 
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Pkey CAT Number Accession 

417159 1653899J R01761 R01760 N49787 

419543 185745_1 AA244170 AI018087AA244355 

450515 83710J AW304226 AW008420AA349212H15015 AA317021 A1829484 H25661 H81744 AI906147AA837938 AW167766 AW603578 AW842369 

BE439926 AA902417 AW235409 AA010062 AW069319 A1280242 AW672925 H06848 H05608 R51905 R45023 AW675471 H28475 AI086597 
AI197815 AI825355 Ng9134 AI075956 AI470122 AM49985 AW662833 AA860423 AA913342T23825 A1394207 AI310319 T32467 AI589870 
AI682293 AI810633 BE223045 H14620 AA626645 AA876023 133571 A^_53_982 Am8631 H1i016_AI30«56_AA98_3ra^^^ 

TABLE 7C 

Pkey; Unique number corresponding to an Eos probesel 

Ftef: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham 1. et a!.' refers to the publicaUcn en 

sequence of human chromosome 22" Dunham I. et al„ Nature (1 999) 402:489-495. 

Strand: Indicates Dltt strand from which ew 

NLposilion: Inc 

Pkey Ref Strand NLpositkin 
402096 8117697 Minus 24993-25186 
403010 3132346 Plus 78385-79052 
404917 7341851 
406274 7543787 



173 



wo 03/003906 



PCT/LS02/21338 



TABLE 8ft: Genes predictive of bladder cancer progression 



ExAccn: 


Exemplar Ac 


cession nunbei 








UiilgenelD: 


Unigene number 








Unigene Title: Unigenegeni 


e title 










SOIh percentile of Ta or T1 ti 


imor Als fromi patients wiio upstaged divided by 80th pern 


;ntileofTa 


orTI tumo 




median of Te 


lorTI tumor All 


5 from patients wtio upstaged divided by the median of Ta 


orTI tumc 


r Als from 1 


Pkey 




UnlgenelD 








413324 


V00571 


Hs.75294 


ESTs 






437802 


AI475995 


Hs.122910 


7.51 


2^50 


444444 


All 49332 


Hs.14855 


ESTs 


2.58 


1,38 


446033 


AV652402 


Hs.72901 








417771 


AA804698 


Hs.82547 


retinoic acid receptor responder (tazaro 






449618 


Ai07B459 


Hs.15978 


KIAA1 272 protein 






407242 


M18728 








418318 


U47732 


Hs.84072 








423441 


R68649 


Hs.278359 








405033 












423024 


AA593731 


Hs.325823 








425118 




Hs. 154672 








437928 


NM 005476 


Hs.5920 








446684 
436608 


U53445 


Hs.15432 








435602 


AF217515 


Hs.283532 








424098 


AF077374 


Hs.139322 








437553 
428036 
446839 


AW068302 
BE091926 


Hs.130497 


ESTs, Weakly similar to MAT8_HUMAN CHLOR 


2^09 


0:91 


448479 


H96115 












AA317962 


Hs.249721 








401241 


AB028989 












M73720 


Hs.646 








415989 


A1267700 










431070 


AW408164 


Hs.249184 








452140 


AB007928 


Hs.28169 


KIAA0459 protein 


2:44 


195 


443162 


T49951 


Hs,9029 


DKFZP434G032 protein 






435904 


AF261655 


Hs.8910 








407379 


AA332127 


Hs.325804 








442712 


BE465168 


Hs.131011 


ESTs 


Z54 


2J2 


411678 


AI907114 


Hs.71465 








406791 


AI220684 


Hs.347939 








431805 


NI\L014053 


Hs.270594 








438414 
413924 


AA806794 
AL1 19964 


Hs.131511 
Hs.75616 








437679 


Nl^014214 
A1985937 


Hs.5753 


inositol(myo)-1(or 4}-monoptiospha[ase 2 






445911 


Hs.145645 








408349 


BE546947 


Hs.44276 








422545 


X02761 


Hs.287820 








406643 


N77976 


Hs.347939 








407228 


M25079 


Hs. 155376 








449644 


AW960707 


Hs. 148324 








402305 






Homo sapiens HCMOGT-1 mRNA for spam ant 


108 


2^25 


441690 




Hs!33106 










AF143867 


Hs.337588 


ESTs, Moderately similar to 565657 alpha 






403362 


AB007860 


Hs.12802 


development and differentiation eniiancin 


1.12 


2^60 


425234 


AW152225 


Hs.1 65909 


ESTs, Weakiy similar to I3B022 iiypolhell 


1.83 




402366 


AV648601 




apolipoprotein B [including Ag(x) antlge 


1.32 


2!05 


427254 


AL121523 


Hs.g7774 


ESTs 






414533 


AA149060 


Hs.296100 


ESTs 


1.70 


£06 


430157 




Hs.273543 


ESTs 




3,00 


413433 


NM_003199 


Hs.326198 


transcription factor 4 


2^26 


1,41 


410532 


T53088 


Hs.155376 


hemoglobin, beta 


1.52 


1.58 


406779 






NM_005367:Homo sapiens melanoma antigen, 


2,42 


1,22 


450455 


AL1 17424 


Hs.25035 


chloride intracellular channel 4 


2,48 


2,13 


414081 


AW96g976 


Hs.279009 


matrix Gla protein 






414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homotog 2 


155 


2^08 


415062 


H45100 


Hs.49753 


uveal autoantigen with coiled coll domai 


1.62 


3.75 


406317 
453259 


1^3125 


Hs.124187 


C2002658*;gi|6625694|gb|AAF19354.1|AF185 
ESTs 


1.68 
1,08 


2.25 


445937 


A1452943 


Hs.321231 


UDP-GakbetaQcNAc beta 1 ,4- gdactosylt 


1.76 


1.01 


434370 


API 30988 


Hs.58346 


edodysplasin 1, anhidroUc receptor 


1.12 


2.08 


418058 


AW161552 


Hs.83381 


guanine nucleotide binding protein 11 


2,02 


2.40 
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W37862 




MSTP032 protein 


4.36 


Z18 




AW080356 




ESTs, Weakly similar to ALU7_HUMAN ALU S 


1.51 


3.45 




AW389845 












Ml 30987 














Hs.278628 


KIAA1481 protein 


170 


2.49 




NM_001262 












BEf48799 


Hs,127951 


hypothetical protein FUMSOs'" 


1,76 






AW248508 




ESTs; homologue of PEM-3 [Ciona savignyl 








AW974941 




ESTs, Weakly similar to 178885 serine/th 








NM 002290 








113 
















A1074465 














Hs,241517 


PRO0327 protein 














161 


Z60 




Alj049274 




Homo sapiens mRNA; ci^NA DKFZp564H203 (Ir 








AI393122 












AI500519 




^oUietical protein PR02714 












diemolfine [C-X-C motif), receptor 4 (fus 








AB031480 


Hs.146824 


SPR1 protein 










Hs.191168 






240 




NM_005397 












AK001927 




hypothetical protein FU10462 


268 


2.98 
















AA205248 




gb:zq78c12.r1 Slralsgene hNT neuron (937 








AW264996 


Hs.254299 










AI660842 






1.24 


2.20 




AA351109 
















ESTs, W^aldy similar lo ALIJF HUMAN 111) 








W27060 










45Q401 


AW959281 


























Hs,27263 








440707 


BE256751 


Hs,22867 


Homo sapiens cDNA' FU22073 fis, clone H 




Z10 
















AA4301 55 


Hs.151343 


KIM1524 protein 










Hs, 194485 










AI910275 












AA830335 


Hs,105273 










AF017986 












AW296466 


















118 






















Ct 400 1020:gi| 1 2597441 |gb|AAG60049. 1 |AF31 


160 


3.08 




AW968318 


Hs.285996 


iiypoUielical protein FLJ2337S 








AA203414 






182 


Z05 




BE30007B 




Homo sapiens, clone IMAGE'J535294, mRNA, 


1.34 






NM_001327 


Hs,8722S 


caicerAestis antigen (l«IY-ES0-1) 








A) 168278 












All 26098 






143 


1.22 








mitogen inducible 2 


1.78 


Z28 






Hs. 125232 


Honx) sapiens cDNA FLJ13266 fis, clone OV 




Z60 




AA194055 


Hs.293858 




168 


167 






Hs 33287 










AW070865 


Hs.346390 










AW204343 


Hs.156823 


ESTs, Weakly similar to T30868 RiioA-bind 












replication factor C (activator 1) 3 (38 








AA248060 


Hs.273397 


KIAA0710gene product 








AA099907 




ESTs, Weakly similar to ALU1 HUMAN ALU S 








AW450439 
























AW903838 


Hs.81800 












Hs.238443 








413444 






gb:MR0-HTO067-201 099-002-b10 HT0067 Homo 


168 


2!80 


433844 


AA610175 


Hs.179647 


Homo sapiens cDNA FU 1 2195 fis, clone MA 




2.01 


427055 


AI301740 


Hs.173381 


dihydropyrimidinase-like 2 




2.58 


454244 




Hs.300842 


KIAA1608 protein 


1.00 


2.02 


429503 


AA394183 


Hs.26873 


ESTs 


2.58 


4.08 


422940 


BE077458 




gb;RC1-BT0506-090500-015-b04 BT0606 Homo 


3.48 


2.46 


407949 


W21874 


Hs.247057 


ESTs, Weakly similar to 2109260A B cell 


ai3 


1.48 


437312 


AA809350 




ESTs 


1.10 


2.05 


449426 


T92251 


Hs!l98882 


ESTs 


122 


2.08 


447620 






ESTs 




2.18 


444700 


NM_003646 


Hs!l1729 


fatly-add-Coenzyme A ligase, very long- 


1.72 


2.81 


4362S8 


AW867491 


Hs.107125 


plasmalemma vesicle associated protein 


1.99 




416712 


AW249188 


Hs.1 69577 


Homo sapiens cDNA FIJ14743 lis, clone m 


166 


2!98 


432877 


AW974111 


Hs.292477 


ESTs 


1.63 


2.79 


412085 


AW891667 




gb:CI*t34gT0089-1 10500-179-1109 NTO089 Homo 


1.40 


2.08 
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431'121 


AW969118 


Hs.108144 


ESTs, Weakly similar to unnamed protein 


1.96 


409714 




Hs.199961 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


1.24 


423013 


AW875443 


Hs.22209 


secrsted modular calcium-binding protein 


1.49 


422663 


AW500087 




zincfinger protein 175 


1.01 


439737 


AI761438 




Homo sapiens mRNA full engtli insert cDN 


1.54 


413196 


AA1 27386 




gb:zn90d09.r1 Slralsqeneliing carcinoma 


1.04 




AI660898 


Hs.195602 


ESTs 




443005 


AI0271B4 


Hs.200913 


ESTs 


1.42 


424762 


ALII 9442 


H5.1 83684 


eukar/olic jansla-jon iniliation factor^ 


2.58 










2.08 


413916 


N49B13 


HSJ5615 


apolipoprotein C-ll 


0.70 


418332 


R34976 


HS.7B293 


ESTs 


2.74 






Hs.170328 




1.28 


456583 


AF179897 


Hs.104105 


Meis (mouse) tiomolog 2 


1.42 


447214 


AI367288 


H5.273621 


Homo sapiens cDNA: FU21350 Us, done C 




449254 


W26908 


Hs.172762 


ESTs 


2.04 




W22152 


Hs,2B2929 


ESTs 


2.94 


421021 


AA808018 


Hs,109302 


ESTs 


1.41 




NM_007019 




ubiquilin carrier protein E2-C 


1.61 


432027 


AL096678 


Hs!272353 




1.70 


452688 


AA721140 


Hs.49930 


ESTs, Weakly similar to putative p150 [H 


1.60 


417042 


C75563 


Hs.1 13029 


ritnsomal protein S25 


2.22 


443574 
429372 


U83993 
AA451859 


H5.321709 
Hs.99253 


pun'nergic receptor P2X, ligandflaled io 
ESTs 


1.28 
1.82 


424290 


AA338396 




gb:EST43386 Fetal br* 1 Homo sapiens c 


428518 


AW969656 




gb;EST381733 MNEE resequences, MAGK Homo 


1.72 


455649 


BE065051 




gb;RC1-BT0313-1105X-017-o04 BT0313Homo 


1.65 


414665 


AA160873 


Hs.332053 


serum amyloid A1 




418298 


AA256014 


Hs.86682 


Homo sapiens cDNA: FU21578 lis, ctone C 


1,04 


429655 


U48959 


Hs,211582 


myosin, light polypeptide kinase 






AA618304 


Hs.258785 


ESTs 




452683 


AI089575 


Hs.g071 


progesterone mentrane binding protein 


1.48 


439437 


AI20778a 


Hs,343628 


sialyltransferase 4B (betagaiactosldase 


1.36 


432314 


AA533447 




ESTs 


0.96 


400881 






NM_02508aHomo sapiens iiypothetkal prat 


1,70 


426477 


AA379464 




gb:EST92386 Skin tumor 1 Homo sapiens cD 


2,01 


454741 


BE154396 




gb;CM2-HT0342^91299.05IH)05 HT0342 Homo 


2.12 


423977 


AA333232 




gb:EST37283 Embryo, 8 week 1 Homo sapien 


1.38 


450396 


AUC77002 


Hs,24950 


regulator of G-proteIn signalling 5 


2.13 


431842 


NM_005764 


Hs,271473 


epitiiellal protein up-regulated in canci 


1.90 


415157 


D63267 




gb:HUIul514B08B Ctontedi iiuman placenta po 




418236 


AW994005 


Hs,337534 


ESTs 


1.74 




AB020713 


Hs.56966 


KIAA0906 protein 


1.47 


436143 


AA705245 


Hs.192189 


ESTs 


1.46 




BE515065 


Hs.296585 


nucleolar protein (KKE/D repeal) 


1.43 












mil 


AW801090 


Hs,ig5861 


acti, afphf2!'s'rrao'lh rasde"^aorta 


270 




AI361801 


Hs.167130 


hypothetical protein 


1,66 


453577 


AL043049 




ab;DKFZp434A1523j1 434 (synonym: htes3) 


1.41 


408522 


A1641214 


Hs.46320 


Small proline-nch protein SPRK [human. 




412632 


AL120379 


Hs,74294 


aldehyde dehydrogenase 7 family, member 


1.81 


423291 


NM.004129 


Hs,126590 


guanylate cyclase 1, soluble, beta 2 


1.54 


456172 


R99050 




gb:yq65c02,r1 Soaresfetel liver spleen 


1.46 


452123 


A1267615 


Hs.38022 


ESTs 




433900 


AA721668 


Hs.257761 


ESTs 


1.78 


408436 


R31954 


HS.788S 


phosphatldylinositol binding clathrin as 




417123 


BE326S21 


Hs.159450 


ESTs 


1.46 


436023 


T81B19 


Hs.302251 


ESTs 


2.68 


454150 


AA131893 
AI695764 


Hs.154088 
Hs,202394 


hypothetical protein FU22756 
ESTs 


1.40 
1.28 


429176 


AW975021 


Hs.193800 


ESTs 


1.08 


422259 


AA307584 




gb;EST1 7B498 Colon carcinoma (HCC) cell 


1.80 


451164 


AA015912 


Hs.60659 


ESTs, Weakly similar to T46471 hypotheti 


1.47 




AL041219 


Hs.82222 


sema domain, immunogtobulin domain (ig), 




455642 


BE063965 




gb;QV3-BT0296-140200-085-h01 BT0296 Homo 


1.70 


443387 


BE139135 


Hs.254629 


ESTs 


1.48 


420995 


AA2824g5 


Hs.89014 


ESTs 


1.45 


407329 


AA576061 


Hs.269834 


ESTs, Weakly similar to ALUD HUMAN llll 




438797 


C16161 


Hs.283040 


hypothetical protein PR02543 


0J9 


443357 


AW016773 




low molecular mass ubk)uinone-binding pr 


1.60 


412656 


AF006011 


Hs.74375 


dishevelled 1 (homologous to Drosophila 


1.32 


427377 


AU077029 


Hs.177543 


antigen identified by monoclonal antlbod 


1.24 


412200 


R08110 


Hs.l 87462 


ESTs, Weakly similar to 138022 hypotheti 


1.35 


432586 


AA66B648 




ESTs 


1.60 


411590 


T96183 




gb:ye09f07.s1 Strategene lung (937210) H 


1.22 


422672 


X12784 


Hs.1 19129 


collagen, type IV, alpha 1 


227 


420256 


UB4722 


Hs.76206 


cadhetin 5, type 2, VE<adhei1n (vascula 


1.59 



wo 03/003906 



PCT/LS02/21338 



25 

30 
35 
40 



60 
65 
70 



419900 


AI469960 






30 


2.68 




AW804742 






16 


2,28 




BE077084 




















i!oo 




AW8ia428 










447400 


AK000322 




hypothetical protein FLJ20315 












ESTs, Highly similar to S94541 1 clone 4 








AW607444 
























AF217508 


Hs,95594 










AF004562 




























ESTs, Moderately similar to Y140 HUMAN H 








AA769365 












H79979 


Hs.88678 


ESTs 












gamma-aminobutyric acid (GABA) A recepto 1 


72 






AWB^89 




ESTs 1 


45 


284 








ESTs 1 


24 






AI989490 




ESTs 1 


12 










ESTs 1 


32 


2.08 


424479 


AF064238 




smoothelln 1 


59 


1.10 




AA768966 




ESTs, Weakly sin^llar to 2109260A B cell 1 


28 






AW972724 




gb:EST384816 MAGE resequeiKes.MAGL Homo 1 


52 


Z63 






Hs.240443 


Homo sapiens cDNA:FU23538 lis, clone L 1 


69 








Hs,76118 


ubiquilin carboxyi-terminal esterase LI 1 


71 


119 




AL1 10175 




Homo sapiens mRNA; cDNA DKFZp564H0616 (f 1 


48 






AW294163 


Hs,146127 


ESTs 1 


07 






AF141305 


Hs,173736 


ancient uDiquitous protein 1 1 










Hs.240728 


ESTs, Moderately similar to PC4259 ferri 0,95 










Target Exon 1 


48 






AA479033 


Hs,130315 


ESTs, Wealdy similar to A475B2B-oellgr 1 


38 










ESTs 1 


38 


3.00 




AW610282 




ESTs 1 


43 






AI852926 




ESTs 1 


18 


2,43 




AW960454 


Hs.222830 


ESTs 1 


56 








Hs.127039 


ESTs 2 


80 






BE2S0127 




CDC20 (cell division cycle 20, S.cerevi 1 


36 










ESTs 1 


30 




440495 






hypotheUcal protein DKFZp434l1930 1 












ESTs 1 


12 










ESTs 1 


34 






AA852604 




Thy-1 cell surface antigen 1 


24 










ESTs 1 


30 




456968 


A1174861 


Hs.190623 


ESTs 1 








NIUL006846 
AW855393 




serine protease Inhibitor, Kazaltype, 5 1 


21 








gb;CM3-CT0275-191099^24-f10CT0275Homo 1 


85 


1.94 








ESTs 1 


66 






AW373472 




gb;RC3*T0523-181299.011J12BT0523Homo 1 


56 


Z30 








enolase 2, (gamma, neuronal) 


56 






AW973566 




hypothetical proteinC321[32.4 1 




124 








ESTs,Uoderatelysimilarto2109260ABc 1 


02 






AU076517 


Hs184276 


solute canler family 9 (sodium/hydrogen 


70 


Z39 








gb:HSC1IG021 normalized infant brain oDN 1 


34 










C5001926:gl|7511572|plr||T42245 probable 1 


28 


2.40 






Hs.66718 


RAD54 (S.cerereiaej-like 1 


52 










hypothetical protein FLJ12879 1 


12 


220 




AW369620 




ESTs, Weakly similar to ALU1 HUMAN ALU S 1 




228 






Hs.169370 


FYN oncogene related to SRC, FGR, YES 1 


88 








Hs.80395 


mal, T-celi differentiation protein 1 


65 










ESTs 1 


24 










ESTs 1 


05 










Inlersedin 1 (SH3 domain protein) 2 


00 




452222 


AW805287 


Hs,21432 


SEX gene 1 


25 


^'.^a 


433010 


AW970018 




gb:EST382097 MAGE resequences, MAGK Homo 1 


36 


241 




AA641092 


Hs,257339 


ESTs, Weakly similar to 138022 hypotheti 1 


14 


203 


438855 


AW946276 


Hs,6441 


Homo sapiens mRNA; cDI« DKFZp586J021 (li- 2.24 


1.77 


448718 


AA220235 


Hs,153959 


hypotheScai protein MGC15436 1 


52 


265 


402585 






Target Exon 2.04 


2,46 


424528 




Hs,238954 


ESTs, Weakly similar to KIAA1204 protein 1 


66 


2,05 


422068 


AI8075?9 


Hs.104520 


Homo sapiens cDNA FU13694 Us, clone PL 1 


89 


4,98 


451225 


AI433694 


Hs.293608 


ESTs 1 


79 


2.70 


441078 


AI453268 


Hs,323409 


HomosapienscDNAFUU113fis,cloneMA 1 


44 


258 


409406 


H83092 


Hs,496Q5 


ESTs 1 


38 


2.05 


422297 


AW961290 




p30DBC protein 1 


20 


2.73 


40B711 


AW376061 


Hs.63335 


ESTs, Moderately simllarto A46010 X-lin 1 


20 


208 


426696 


AW363332 


Hs.171844 


Homo sapiens cDNA:FU222g6fls,ck>neH 1 


35 


268 


417324 


AW265494 




ESTs 1 


68 


1.25 



wo U3/003906 



PCT/US02/21338 



408283 


BE141579 




gb;C(i/2-HT0083-071299-018-b05 HT0033 Homo 
carbdiypeplldase Z 


1,25 


265 


415166 


NM-003652 


Hs.78068 


1.34 


1.09 


406300 






Target Exon 


1.61 


2.47 


411BB0 


AW872477 




gb:hm30fO3,x1 Na_CGAP_Thy4 Homo sap'ens 


3.60 


4.03 




F16365 


Hs.114346 


cytochrome c oxidase subunit Vila polype 


2.16 


1.44 


422567 


AF111178 


Hs.1 18407 


glypicane 


1.57 


2.03 


436855 


M732624 




ESTs 


1.08 


2.76 


403535 






Target Exon 


0.93 


2.13 


447733 


AF1 57482 


Hs.19400 


MAD2 (mitotic arrest deficient, yeast, ti 


1.18 


1.07 


417117 


N46778 




gb:yy52bOZr1 Soaresjnultiplejclerosis_ 


1.70 


2.85 




M669253 


Hs.135075 


RNA,U2 small nuclear 


2.12 


2.78 


443243 
423074 


AI452496 
AL1 09963 


Hs.1 32055 


ESTs 

FSH primaiy response (LRPR1, rat) liomolo 


1.15 


2.83 
1.43 


408916 


AW295232 


Hs.429 


ATP synthase, Htransiwrting, milodiondr 


1,63 


2.23 


449799 


A1143456 


Hs.125060 


ESTs 


1.40 


2.08 


415378 






gb:MIB2079-5R Normalized infant brain, B 


1.88 


1.85 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


1.57 


2.57 


434959 


AW974949 


Hs.186564 


ESTs, Weakly similar to 138022 hypotheU 


1.30 


2.30 


416311 


D80529 




gb:HUM081H05B Human fetal brain (TFujiwa 


1.58 


4.35 


444614 


R44284 


Hs.2730 


heterogeneous nuclear ribonucleoprotein 


1.88 


2.98 


456206 


NM_006895 


Hs.81182 


histamine N-methyltransferase 


1.24 


2.08 


410583 


AW770280 


Hs.36258 


ESTs, Moderately similar to JC5238 galac 




4.33 


430410 


AF099144 


Hs.334455 






1.58 


408139 


AA451966 




RAB9-like protein 


1.42 


Z14 


432621 


AI29B501 


HS.12B07 


ESTs, Weakly similar to T46428 tiypotheti 


2,08 


1.94 


441584 


AW1 48329 


Hs.176208 


ESTs 


1.12 


2.05 


445940 


D60438 


Hs.34779 


ESTs 


1.86 


Z70 


453022 


AA031499 


HS.1184B9 


ESTs 


2,02 


1.75 


444008 


BE544855 


Hs.236572 


ESTs, Wfeakly similar to SFR4_HUMAN SPLIC 


1,64 


1.29 


442994 


A1026718 


Hs.16954 


ESTs 


3,60 


178 


402085 






C18000604'.'gil26274361gb|AA88668ai| (Al= 
gb;PI*H'T0019-141299^)09-F08 PT0019 Homo 


1.36 


253 


411918 


AW876354 




ZOO 


2.63 


455508 


AW976165 




gb:EST388274 lulAGE resequences, MAGN Homo 


1.70 


ao4 


426106 


A1678765 


Hs.21812 


ESTs 


1.49 


211 


425131 


BE252230 


Hs.99163 


ESTs 


2.04 


265 


440325 


m 003812 


Hs.7164 




1,17 


255 


420447 


AA687306 


Hs,88448 


ESTs 


1.66 


258 


428055 


AA420564 


Hs,101760 


ESTs 


1.08 


215 


422110 


AI376736 


Hs,111779 


secreted protein, acidic, cystelne-rfch 


1.76 




438581 


AW977766 


Hs,292133 


ESTs, Moderately similar to 178885 serin 


1.08 


210 


403290 






C10001011*ail4758212lreflNP_004411.1ld 


0.97 


248 


408175 


W29089 


Hs,19066 


hypothetic^ protein DKFZp66702416 


1.42 


205 


432390 


AA936177 


Hs,274460 


olfactory receptor, family 5, subfamily 


1.26 


443441 


AW291196 


H5,92195 


ESTs 


1.62 


213 


419925 


AA1 59850 


Hs,93765 


lipoma HMGIC fusion partner 


1.72 


2.80 


445256 


AI858635 


Hs.l 44763 


ESTs 




3.33 


456381 


AA236606 




gb;2r99b10,r1 NCLCGAP_GCB1 Homo sapiens 


1.16 


1.95 


422433 


AA310560 


Hs,153746 


hypothetEd protein FU22490 


1.06 


220 


432529 


Aig89507 


Hs,162245 


ESTs 


1.36 


225 


424951 


AW964082 




gb:EST37S155 MAGE resequences, MAGH Homo 


2,22 


258 


420785 


H89633 


Hs,191346 


ESTs 


1.26 


215 


411347 


AW838126 




gb;CJV2-LT0051-24030fr097-f01 LT0051 Homo 


1,38 


238 


438742 


AW204126 


Hs.196543 


ESTs 


1,10 


230 


414900 
443284 
42049 


AW452420 


H5.248678 


ESTs 


2,01 


3.08 


AI369813 


Hs.64783 


ESTs, Wealdy sim9ar to T42705 hypotheU 
Target Exon 


0,66 
Z28 


0.43 
200 


429400 


AW604940 


Hs.201668 


transcrtptlon factor 20 (AR1) 


1.16 


200 


423916 


AW993496 


Hs.17235 


Homo sapiens clone TCCCIA00176 irsm sequ 


1,59 


1.05 


432495 


AW973537 


Hs.186734 


ESTs, Weakly similar to 161746 pheromone 


1.50 


205 


414840 


R27319 


H5.23823 


haiiy/eniiancer^of-splt related with YRP 


1.89 


209 


428711 


R4B414 


Hs.56828 


trinucleotide repeat containing 5 


1.77 


1.83 


448609 


AW1 39420 


Hs.7972 


K1M0871 protein 


1.14 


226 


443859 


NM_013409 


H5.9914 




1.17 


1.05 


411141 


AW819561 




gb:RCS-ST0293-140200-013-G04 ST0293 Homo 








AI798851 


Hs.266959 


hemogtobln, gamma G 


l!l8 


208 


417944 


AU077195 


Hs,82985 


collagen, type V, alpha 2 


2.10 




429640 


U83508 


Hs,2463 




1.92 


298 


410064 


X53416 


Hs.195464 


filamin ^ alpha (actn-blnding protein- 


1.51 


1.29 


468218 


A1435179 


Hs,126820 


ESTs 




1,15 


443114 


A1033377 


Hs,153298 


ESTs 


1.38 


205 


427788 


AMI 2397 


Hs.1 16858 


ESTs 


1.45 


1.85 


435913 


W95006 




ESTs, Weakly similar to S65657 alpha-IC- 


1.63 




467949 


W69171 


HsJ1741 


ESTs, Highly similar to 138946 melanoma 


1.01 


200 


419203 


AA.1B8719 


Hs.190151 


ESTs 


1.94 


245 


412610 


AI05668g 


Hs,133538 


ESTs, Weakly similar to ALU1JHUMAN ALU S 


1.91 


220 


413885 
426239 


BE177442 
AA569615 


Hs.214226 


gb;RC1-HT0595-200400-012-f01 HT0595Homo 
ESTs 


1,48 
1.36 


280 
250 



wo 03/003906 



PCT/LS02/21338 



A1703484 
BE167242 
BE008018 
AW772713 



417777 
401400 
426798 
435046 
448401 



441214 
431723 
414907 



444014 
431603 
408697 
444312 
404286 
438813 
445534 
426046 
451907 
416796 
422431 
417657 
455313 



BE154847 
BE262773 
AI095718 
AA807955 
AW419069 
R44007 

M27346 
AL038823 
AA833665 
AI822066 
AA228351 

AA225622 
AW894409 
F10042 



Hs.255036 
Hs.1 28052 
Hs.47099 

Hs.247186 
Hs.151839 

Hs.7055 

Hs.1 72405 
Hs.174330 
Hs.346254 
Hs.105421 
Hs.126483 
Hs.1 27307 
Hs.200854 
Hs.129136 
Hs.16762 
Hs.77597 



ESTs 

Target Exon 

ESTs, Weakly similar lo I7B885 seM 

Tangel Exon 

cell division cycle 27 

ESTs, Weakly similar to ALUIJHUMAN ALU S 

ESTs 
ESTs 



,;cDNADKFZp564B2062(f 
polo |Dr(sophia)-llke kinase 
gb;PIVI1-HT0345-121199-001-d05 Hn)34S Homo 
hypothetical protein FU10767 



Hs.1 2840 
Hs.206868 

Hs.50749 



Hs.209670 ESTs 
ESTs 



C600ig09:gl|704441|dbj|BAA18909.1| (D298 
gb;Homo sapiens (clone HGP09WGP32) Tee 
Homo sapiens germline mRNA sequence 
Homo sapiens cDNA FU14056 lis, cbne HE 
ESTs, Moderately similar to ALU7_HUMAN A 
ESTs 
ESTs 



Hs.1 25472 ESTs, Moderately similar to KIAA0877 pro 



144538 Hs.231863 ESTs 



HS.U5903 

Hs.3569 

Hs.301183 

HS.1BB750 

Hs.87134 

Hs.313636 
Hs.144153 
Hs.1 58549 



427681 AB018263 Hs.1 80338 

436164 AA764950 Hs.1 19898 

451233 AA047221 Hs.59752 

446856 AI814373 Hs.164175 

448211 BE384592 Hs.6451 

418283 S79895 Hs.83942 



ESTs 

Homo sapiens clone 26061 mRNA sequence 
Human DNA sequence from clone RP5-881L22 
myeloproiiferalive leukemia virus oncoge 
gb:Homo sapiens mRNA fragment 
Homo sapiens cDNA FLJ12257 fis, clone MA 
ESTs 



gb:PMO-HT0339-081199-001-h05 mm Homo 



ESTs, Weakly similar to T2D3.HUMAN TRANS 
Homo sapiens mRNA; cDNA DKFZp564C0371 (f 
tu mor necrosis factor receptor superbmi 
C8001690*:gl|67544461ref|NP_034760.11 W 



AW954938 
AWg76145 
AA846824 



441354 
428951 
438272 
429642 
422121 
411184 
435871 
430570 



AL138153 

X68264 

AI767949 

AW821117 

AF257077 

A1417B81 

AL080197 



Hs.172816 

Hs.1 80908 

Hs!31314 
Hs.1 26813 
Hs.300410 



HS.2B3627 
Hs.292484 
Hs.272302 



of growth family, member 3 



relinoblastania-blnding protein 7 
ESTs 

ESTs, Moderately similar to A47582 B^l 
ESTs, We^ly similar to S65824 reverse t 
melanoma cell adhesion molecule 



wo 03/003906 



PCT/LS02/21338 



10 
15 
20 



45 

50 
55 





AW296946 


Hs. 256078 


ESTs 








BE149154 




gb:RC2-HT0252-271 099-01 7-c11 HT0252Homo 












gb;HSC1QD011 normalized infant brain cDN 








AI555566 


Hs. 168587 


























gb:RC1>CT0252-030100-023-g09 CT0252 Homo 
































AW298047 


Hs.346198 










AA1 77088 


Hs.190065 


























NM 022137":iHomo sapiens secreted modular 






















































AVV752656 


Hs.222707 


i<iAA1718 protein 












gb:x39d05>r1 NCLCGAP Pr2 Homo sapiens 










Hs,143739 






































ENSPOC000243337:CDI^ FLJ13984 f s, done 






403903 






C5001 632*;gi| 1 0645308|gb|AAG21430.1 iACCO 




















BE501689 














Hs.25450 


































A120851 1 




Homo sapiens cDNA iTJ1 1720 lis, clone HE 












gb:QV4-TT0008-18119&4)38-li04 TT0008 Homo 












ESTs. Weaifly similar to 178885 serine/tii 








AF086303 








































gb"CM3-CT0275-191099-024-e06 CT0275 Homo 










































424408 


A175481 3 


Hs.1 46428 










AK000742 


Hs.1 26774 






















AI686579 








































Human clone 137308 mRI^, partial cds 








AI864193 


Hs.1 69728 


iiypoUietical protein I-IJ13150 












iiomolog of Yeast RRP4 (ribosomal RNA pro 












iiypotiietlcat protein FLJ13459 
















































gb CMO-U[i'IOD01-010300-258-h1 1 UiwIOOOl Homo 




































EN SP000002491 58':CDI'1A 


































































423264 


















gb'nc15d10.s1 NCI CGAP PT^ Homosaplens 






410370 


AB037753 


Hs.62767 


KIAA1 332 protein 


134 


2.00 


417264 


AA195100 


Hs!l88695 


ESTs 


1.09 


2.51 








ESTs 


1.56 


2.92 


419386 


AA236867 




ESTs, Weakly similar to 138022 hypotheli 




2.28 


439212 


AF087995 


Hs.134877 


ESTs 


1.06 


2.90 


437765 


W69171 




ESTs, Higiily similar to 138945 melanoma 


1.25 


2.28 


448951 


A1611221 


HS334802 


iiypotiietical protein FU10101 






401659 






Target Exon 


l!63 


m 


419145 


N99638 




gb2a39g11.r1 Scares fetal liver spleen 


2.63 


3.85 


444813 


AW064834 


Hs.210356 


ESTs 


1.73 


2.14 


433902 


AW292820 


Hs.144906 


ESTs 


1.88 


2.46 


403072 






NI»l.003319*:Homo sapiens I'Un (TTN). mRN 


1.32 


2.91 


452484 
45678B 


AB033042 


Hs.29679 


coMor required for Spl transcrlptiona 


0.72 


0.70 


AA724612 


Hs.133130 


Homo sapiens mRNA; cDNA DKFZp566H0124 (f 


1.90 


3.40 


403315 








1.22 





wo 03/003906 



PCT/LS02/21338 



406432 


AJ289116 




CD1E antigen, e polypeptide 


2.31 


2.63 


457785 


AA682670 


Hs.160884 


ESTs 


0.96 
1.46 


2.38 
1.07 


433259 
436882 


AA580665 
AW016722 


HS.326C82 
Hs.194976 


ESTs 

SH2 domain-containing plnsphalase anchor 


1.38 


2.13 


401473 






Target Exon 




2.04 


444816 


248633 


Hs,283742 


H.saplens mRNAforrelrotransposon 


1.64 


2.15 


438029 




HS.1C235 


chromosome 5 open reading frame 4 




2.25 


431830 




Hs.271387 


small Inducible cytokine subfamily A (Cy 




2.21 


450817 


N71597 


Hs,29698 


ESTs, Weakly sWlar to ZN91_HUMAN ZINC 


2.20 


2.90 


404427 






C8000068*^|ll5453579|reflNP_006120.1| bo 


0.74 


0.81 




AW970093 


Hs,24453 




1.45 


2.55 


405723 






Target Exon 


1.60 


2.28 


436896 


AW977385 


Hs.278615 


ESTs 


1.17 


1.54 


411974 




Hs,84264 


acidic protein rich In leucines 


1.54 


2.08 




AI123478 


Hs,32112 


ESTs 


1.72 








Hs,255418 


ESTs 


1.25 


1.31 


424991 


AA775471 


Hs.241467 


ESTs 


0.62 


0.37 


443100 


AI033188 




gb;ow94e08.s1 Soares Jetaljlver_spleen_ 


1.15 


2.34 


445332 


AI220225 


Hs.321057 


ESTs 


1.07 


2.00 


414781 


D50917 


Hs,77293 


KIAA0127 gene product 


1.04 


3.43 


421893 


NM 001078 


Hs,109225 


vascular cell adiieslon molecule 1 


1.15 




424265 


AF173901 


Hs.144287 


hairy/enhancer-of-splil related with Yi^ 


1.26 


2.00 




BE161878 


Hs,224805 


ESTs 








AA65258B 


Hs.301348 


Homo sapiens cDNA FU13271 Us, done OV 


Z26 


1.29 


438190 


AA780020 


Hs.136798 


ESTs, Moderately similar to KBF3_HUMAN N 


1.44 


2.07 


406972 


U32053 




gb',Human H19 IWt gene, complete cds. 


1.38 


2.11 


417086 


AA194446 




ESTs, Weakly similar to SSS024 neliuiin, 


Z12 


3.30 


446410 


A1361109 


Hs.151721 


ESTs, Weakly similar to 138022 hypotheti 


1.14 


2.33 


427674 


NM_003628 


Hs.2178 


H2Bhlslone family, member Q 


1.24 




422526 


AA311763 


Hs.131056 


ESTs 


1.29 


2.04 




AF086127 


Hs.50600 


ESTs, Weakly similar to T47156 hypotheti 


1.26 


2.12 


409126 


AA063426 




gb:2f70c08.s1Soares4!lnealjland_N3HPG 


1.28 


2.20' 


412093 


BE242691 


Hs,14947 


ESTs 


0.87 


2.58 


420169 


AA256126 


Hs.16179 


hypotiietical protein FLJ23467 


1.38 


207 


426096 


D87436 






2.00 


2.25 


402551 






NM J05012*:Homo sapiens receptor tyrosin 


0.80 


0.82 


405760 






Target Exon 




2.65 


402901 






NM.025206*:Homo sapiens hypothetfcal pro 






453982 


AW014252 


Hs.252837 


ESTs 


1.44 


2.03 


424244 




Hs.143601 


hypothetical protein hOAiso 


1.40 


1.18 




BE559514 


Hs.275426 


liypothetical protein 


1.30 


2.15 


457297 


AW968188 




gb:EST380383 MAGE resequences, MAGJ Homo 


1.64 


3.17 




AI733907 




gb:2o86h09.yS Stratagene ovadan cancer 


1.00 


2.20 


426273 


AI174861 


Hs. 190623 


ESTs 


1.19 


1.15 


405187 






NM 01 4272:Homo sapiens a dislntegdn-lik 




1.35 




AL047051 


Hs,199961 


ESTs, Weakly similar toALU7_HUMAN ALU S 


2.44 




427596 


AA449506 


Hs.270143 


extracellular glycoprotein EMILIN-2 prec 


2.06 


1.50 


408049 


AW076098 


Hs.345588 


desmopiakia(DPI,DPil) 


2.01 


3.90 


406002 






Target Exon 


1.73 


2.08 


408284 


AW248254 




protein kinase PKNbela 


1.28 




431377 


AW178807 


Hs,246182 


ESTs 




2.70 


451456 


AW386183 


Hs,210305 


ESTs 


1.44 


2.08 


427530 


AA406093 


Hs.126519 


ESTs 


1.07 


1.12 


431957 


AK002104 


Hs.272246 


hypolheUcal protein FU11292 


1.27 


2.89 


422283 


AW411307 


Hs.1 14311 


CDC45 (cell division cyde 45, S.cerevis 


1.74 


1,28 


419600 


AA448958 


Hs.91481 


NEU1 protein 


1.13 


1.15 


423314 


A1400661 


Hs,1 27811 




1.44 


2.53 


451690 


AW451469 


Hs,209990 


ESTs 


1.41 


2.49 


454662 


AW812715 




gb:RC4-ST0185-271099-01 1-g01 ST0185 Homo 


1,35 


2.88 


454413 


A1653672 


Hs.40092 


PNAS-123 


1.79 


2,03 


416861 


AW977206 


Hs.151858 


ESTs 


1,52 


2,10 


415908 




Hs.22833 


ESTs 


1.37 


2.13 


438942 


AW875398 


Hs.6451 


PRO0659 protein 


1.80 


1.55 




AW054922 


Hs,53478 


Homo sapiens cDNA FU12366 is. clone MA 






429177 


AA447527 


Hs,207429 


ESTs 




3^19 


448357 


N20169 


Hs.108923 


RAB38, member RAS oncogene family 


0J7 


0.73 


422008 


AJ000534 


Hs.l 10708 


sarcoglycan, epsilon 




4.08 


434461 






ESTs, Weakly similar to 178885 serineAh 


166 


2.16 


413489 


BE144228 




gb:MR0-HT0185-1402004)09-d04 HT0165 Homo 


1.28 


2.23 


405551 






Target Exon 


1.11 


1.19 


441183 


BE562910 


Hs.92679 


Homo sapiens done CDABP0014 mRNA sequen 




1.20 


456034 


AW450979 




gb:UI-H-Bi3-ala-a-12-0-Ul.s1NCl_CGAP_Su 


1.98 


2.53 


420611 


AA994635 


Hs.129929 


ESTs 


1.46 


2.15 


422061 


BE178434 


Hs.267995 


ESTs, Moderately similar to G02654 ribos 




2.30 


437908 


AI082424 




ESTs 


138 


22\ 


423052 


M2B214 


Hs.1 23072 


RAB3B, member RAS oncogene family 


0.74 


043 


401927 






C17000914«;gll8394367lref|NP_058549.1|s 


2.26 


2.14 



wo 03/003906 



PCT/LS02/21338 



35 
40 
45 





M572949 


Hs,207555 




1.52 


228 


439159 


AF087972 




ESTs 


2,03 


Z08 


415357 








1.82 


Z07 




AA991745 


Hs,42522 




1.48 






AB020696 


Hs,234791 


KIAA0889 protein 


1.46 


2.23 




Al 809984 


Hs.243209 


ESTs, Weakly similar to NPA1_HUMAN NEURO 


1,06 


2.13 




AW068823 


Hs.1 19206 


Instilin-like growth factor lifnding prote 




2^84 




AI076345 








429040 


AL035542 


Hs,248159 


olfactory receptor, family 2, sulifamily 


r25 




414169 


AA136169 


Hs,1 49335 


ESTs 




Iso 




AA687313 


Hs, 190043 




120 




426900 


AW1 53564 




ESTs 


187 


177 






Hs.325474 


Target CAT 




2.02 




AL360140 




Homo sapiens mRNA full lengtli insert cDN 




2.10 




AW0B5131 


Hs.190452 


KIAA0365 gene product 


1.41 


1.52 


408339 


AW277084 




gb3<p61li09.x1 NCLCGAP_Ov39 Homo sapiens 




2.03 


450492 


AW290961 


bb.2Q1815 




1.17 


2.21 








Homo sapiens done 15 polyadenylated HER 


1,62 






AW939074 




gb:QV1-DT0069-01 0200-057-c12 DT0069 Homo 


1.80 


173 




AI904823 


Hs.31297 




2.15 


1.84 




AA225048 


Hs.104207 






2.33 


439099 


AB037800 




protein Idnase C and casein kinase subst 


107 


1.15 








Homo sapiens mRI^ cDNA DKI7p566F1946 (f 


1,18 
2.24 


1.22 
2.55 




AA236863 


Hs.188894 


ESTs, Weakly similar to APXLHUMAN APiCA 


1.07 


1.14 


402948 










1.83 




AW296631 






128 


107 




AW629604 




hypothetical protein from EUROIMAGE 1703 


0.85 


0.77 




AA334592 


Hs79914 




1.88 


1.27 




AW967280 


Hs.293894 


ESTs, Weakly similar to HERC2 [Rsaplens 


1.20 










ESTs, Moderately similarto ALLI1_HUMAN A 


1.03 


2.31 




BE297851 


Hs!l89482 






2.20 








ESTs, Weakly similar to Y934_HUMAN HYPOT 




2.25 


400850 








122 


1.03 


404580 








1.00 


1.00 




AVV064284 












AA224053 


Hs,172406 


cell division cycle 27 








AA907153 


Hs.190060 




176 


1.45 




AA384910 


Hs.46519 






2.15 








DKFZP434B061 protein 








AI659131 




hypothetteal protein MGC2849 


1.00 








Hs.285681 


WIIIiamS'Bauren syndrome chromosome regi 


1.40 




440304 


BE159984 


Hs. 125395 


ESTs 


100 






AW511126 


Hs.127572 


ESTs 


1.00 


173 




A1792453 


Hs.166507 






1.00 






Hs.201955 




100 


1.00 








H26 hisione family, member L 


1.00 


1.00 




AA279143 


H&88561 


ESTs 


100 


140 




A1472022 


Hs.301959 


proline synthetase co-transcribed (bacta 


0.98 


0.63 






HS.209G94 




1.00 


1.00 




AA831552 


Hs.268016 


Homo sapiens cDI4A: FU21243 Us, clone C 


1.00 


1.00 








H3 histone family, member L 


114 


140 




AW971303 




ESTs 


1.08 


ISO 




AW876263 




gb:PM4-PT0019-131299-006-E09 PTD019 Homo 


0.82 


0.45 




AW970989 




gb;EST383074 MAGE resequences, MAGKHomo 




0.71 






Hs.282244 


ESTs, Weakly similar b pepUdoglycan re 


100 


2.10 




AA336519 


Hs.83623 






1.00 






Hs. 156084 




106 


2.02 




AI015418 


Hs.1 27556 






2.03 


402039 










0,36 








KIAAD776 protein 




0.63 








nuc^^arrece rsu y .group ,m 




100 


409582 


R27430 


Hs.271565 




1,00 


428769 


AW207175 


Hs.106771 


ESTs 


100 




442868 


AI022701 


Hs.336984 


ESTs 


127 


l',62 






Hs.173921 


ESTs, Weakly similar to 210g260A B cdl 


100 


1,33 






Hs.1 72979 


zinc finger protein 177 




2.25 


419015 


T79262 


Hs.14463 


ESTs 




2,03 


415806 


AA169560 




gb:2o89d08.r1 Stratagene ovarian cancer 


100 




436110 


AA704899 


Hs.291651 


ESTs, Weakly similar to 138022 hypotheli 


1.60 


2!21 


458760 


Al 498631 




fen-itin, liglit polypeptide 


1.10 


143 


447342 




Hs! 19322 


Homo sapiens, Similar to RIKEN cDNA 2010 


1.28 


1.00 


438182 


AW342140 


Hs.1 82545 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


0.90 


2.31 


438091 


AW373062 




nuclear receptor subfamily 1, group 1, m 


4.70 


1.00 


441633 


AW958544 


Hs.1 12242 


normal mucosa of esophagus specific 1 


2.48 


1.00 


432222 


AI204995 




gb:an03c03.x1 Stratagene schizo brain SI 


196 


2.84 


416055 


Z45423 


Hs.13349 


Homo sapiens cDNA FU14647 fis, cbne NT 


1.52 


1.46 



wo U3/003906 



PCT/US02/21338 



55 

60 
65 





AA836392 




hypothetical protein FU20495 1 








AV647015 










408969 


AW297929 


Hs.328317 


paired immunoglo ulinlilie receptor ta 












Homo sapiens cDNA: FLJ23220 fis, clone A 












KIAA1Q52 protein 








AW976247 












BRCAlb 




Eos Control 


36 


1^32 












1.18 
















AA325560 












A1878910 


Hs. 278670 


cisplatinresislance^soclatedoverBxpr 








AW451780 












AA399024 












AL036128 


Hs.274404 










AW293459 












AA723297 


Hsll^3B 




18 


116 
















AI939440 


Hs,345192 












Homo seplens cDNA: FIJ21 1 22 fis, clone C 1 




2.48 




BE348404 


Hs]24740 




40 


2.75 




AW753456 




gb:QV2-Cr0261-2B1099-011-<l11 CTOZBI Homo 








AA335715 


Hs,200299 










NM 014921 


Hs.107054 










AI290691 


Hs,131393 










Al 523875 




gb:lo97d04.x1 NCLCGAP_CLL1 Homo sapiens 








AW444548 


HS.16311B 








A1052628 


Hs.271570 


ESTs, Wfeakly similar to 21 09260A B cell 








AA5B0288 




gb-jin12d01.s1 NCI_CGAP_Co12 Homo sapiens 








BE0B6548 
















^cineurin 1 ing protein ca sara 


06 










ESTs 1 


62 






AI559696 


H3,298885 


ESTs 1 


37 










ESTs 1 


62 










gb:TCBAP1E0661 Pediatric pre-B cell acut 1 












ESTs 1 








AW295791 




G prolein-coupled receptor 86 


47 






AF085126 




Homo sapiens cDNA FU1 1727 fls. clone HE 2 


08 






AA808948 




ESTs, Modsratelj' similar to ALU1_HUMAN A 1 








AK001084 


Hs.333498 


Homo sapiens cDNA FU10222 fis, clone HE 2 


17 






AL1 20295 




ESTs, Moderately similar to PC4259 ferri 1 


88 


2.58 




RS943S 


Hs,255889 


ESTs 1 












KIAA1698 protein 1 


15 










ESTs 1 


38 






AA464964 




gb:zx80f10£l Soarss ovaiy tumor NbHOT H 0.92 










ESTs, Vtealdy similar to S41044chromosom 1 




2.51 




BE065069 


Hs.270833 


amplilregulln (schwannoma^lerived giDwUi 


94 










secreted modular calcium-binding protein 1 


72 


268 




A1498267 


Hs!l 10613 


KIAA0421 protein 2.28 






AI248285 


H5.118348 


ESTs 1 


42 


245 




AA584B29 


Hs.275163 


non-metastatic cells 2, protein (NM23B) 2.07 


253 




AA319278 




gb;EST21478 Adrenal gland tumor Homo sap 


46 


1.22 




BE259160 




aldo-keto reductase family 1 , memtier B1 


43 


1,43 




N71B20 


Hs.1 18173 


ESTs 1 




293 


441063 


AA913819 


Hs.188025 


ESTs 1 


20 


280 








gb:yi79b05.rl Scares breast 2NbHBst Hono 1 


44 


4.18 


430446 


AF131782 




Homo sapiens clone 24941 mRNA sequence 1 












NM_000761:Homo sapiens cytochrome P450, 1 


13 


1.22 








C5000010';gi|10440464|dbj|BAB15765.1| (A 1 


22 






AI204995 




gb:ar>03c03.x1 Stratagene sctiizo brair SI 2 


72 










ESTs, t^Dderalely similar to ALUI.H'JMAN A 1 












OKFZP555K1 924 protein 1 












Wolf-Hirschhorn syndrome candidate 1-lik 


33 




448455 


AI252625 


Hs.269860 


ESTs, [Moderately similar to S65657 alpfia 0 


63 


o!40 








ESTs 1 


66 


2.50 


414623 


BE391050 




gb:601285674F1 NIH_MGC.44 Homo sapiens c 1 




a88 


454915 


AW841619 




gb:RC1-CN0017-12C2QO-012-b09 CN0017 Homo 1 


14 


2.10 


444064 


W85970 


Hs,16292 


ESTs 0 


80 


0.63 


454353 


AW389693 


Hs,300700 


hypothetical protein FU20727 1 


30 


2.10 


447794 


AI424999 


Hs.161445 


EST 1 


26 


205 


426686 


AI362802 




parathymosin 1 


16 




435815 


AA700482 


Hs!l13157 


ESTs 1 




273 


432482 


LI 9267 


Hs.276924 


dystiophia myotonlca-containing WD repea 1 


10 


1.15 


431062 


AA491270 


Hs.187946 


ESTs 1 




260 


429191 


AF065215 


Hs.198161 


phospholipaseAa, group IVB(oytosolio) 1 


35 


1.06 


424456 


AA341017 


Hs.25549 


hypothetical protein FU2089B 1 


21 


345 


451124 


AI186203 


Hs.31432 


cardiac ankyrin repeat protein 




210 


432828 


AB042326 


Hs.287402 


chondroitin4-sulfolranslera3e 1 




271 



wo 03/003906 



PCT/US02/21338 



BE246400 Hs.285176 



25 
30 

35 



420232 
418927 
441940 



420067 
442180 
434256 
444619 



AW205708 
AI439905 
T52431 



Hs.255295 

Hs]l28152 

Hs.60748 
Hs.292725 
Hs.344476 
Hs.94795 



acetyl-Coenzyme A transporter 
gb:zx37g07.r1 Soares_lotal_fetus_Nb2HFe_ 
gb:yi87g02.r1 Soares placenta Nb2HP Homo 
gb:Homo sapiens full lenglli Insert cDNA 

ESTs 
ESTs 
ESTs 

C9000193*:gi|6330729|dbi|BAA86547.1| (AB 
chromosome 21 open reading frame 18 
ESTs, Weakly similar lo 118818 hypotheli 
gb:ti57g08.x1 NCI_CGAP_Lym12 Homo sapien 
Homo sapiens mRNA; cDNA DKFZp6640222 (fr 
ESTs 



A1160304 Hs.28313 

AW85573B Hs.17767 
BE177983 

A13S7B23 Hs.146872 

N49729 Hs.156875 

AWig2334 Hs.38218 

BE390SS1 Hs.77628 

BE559SS4 Hs.61790 

AW977288 Hs,17428 

AA744520 Hs.87734 

AI798039 Hs.270563 

AI827946 Hs.124354 

AA833902 Hs.270745 

BE537128 Hs.299797 
AI500724 



423436 
468697 
415770 
449279 

442124 
412557 
409335 



421338 
412679 
417882 
426112 



457329 A1634860 
434830 AW862235 
450696 AI854223 
AW072215 
BE562018 



446098 
443310 
424015 
420229 
403371 
410744 
424160 



432211 BE274530 

448918 AB011152 

424496 AI733451 

410730 AW368850 

457581 AA578512 

435353 AW243062 

417029 AW962192 

433682 AA642418 

424915 R42755 

442201 AW516704 

429111 AI870811 



steroidogenic acute regulatory protein r 
C1400016S:gi|1269806g|dbJ|BAB21B53.1|lA 
hypotlietlcai protein FU23338 
RBPI-lte protein 

ESTs, Wealdy similar to nonsyndromic ilea 
gb;RC1-DTl3029-12010(H)11-IO7 DT0029 Homo 
ESTs, Moderately similar to T12512 iiypol 
hypotlietlcai brain protein iny040 

ESTs 



m5034 gb:ym 

R21176 Hs.1 00926 ESTs 

A1797713 Hs.1 56471 ESTs 
M79237 

AI962312 Hs.224976 

AA458759 Hs.188794 

R66412 Hs.129013 

AA761612 Hs.291557 

NM_001502 Hs.53985 

AF181862 Hs.242407 
AA281092 
AA033904 

W44877 Hs.55501 

A1S71294 Hs.298889 
AA287443 
BE144762 



> sapiens cDNA FU14309 f s, cbne PL 



Hs,269236 ESTs 



2 (zymogen gn 
ipied receptor, family C,gr 
Homo sapiens cDNA: FU22806 Us, clone K 



AWgS3291 Hs.64211 

AW377549 Hs.1 7865 
A1553976 Hs.121191 
AA360439 Hs.B9319 
Hs.247043 



ESTs, Moderately similar to ALU1.HUMAN A 
gb:zs52clar1 NCI.CGAP.GCBI Homo sapiens 
gb:CMO-HT0180-041099-06S-b04 HT0180 Homo 
gb:yh26cag.r1 Soares placenta Nb2HP Homo 
hypotlietlcai protein MGCSeoi 
C160002iagl|12585542|sp|O14771|Z213.HUM 
ESTs 
ESTs 



Hs.208470 ESTs 
Hs.133152 ESTs 



Hs.273385 
Hs.17381 
Hs.23096 



Homo sapiens mRNA; cDNA DKFZp564F112 (fr 
ESTs, Weakly similar lo AF252293 1 PARS 
Target Exon 

gb:ys92b01.r1 Soares retina N2b5HR Homo 

gb:yc81f01.r1 Soares Infant brain 1N1B H 

ESTs 

ESTs 

ESTs 

hypothetical protein FU10986 

K1AA0580 protein 

hypottietcal protein FU12975 

DnaJ (Hsp40) homolog, subfamily B, membe 

gb:nh22e11.s1 NCLCGAP.PrI Homo sapiens 

ESTs 

guanine nucleotide binding protein (G pr 



wo U3/003906 PCT/US02/21338 



429282 










3.73 


436604 


AW105129 












W0 1046 


Hs.333371 


Homo sapiens clone TA4D unt-arslated mRN 


079 


2.70 


412274 


Ml 01443 




gb:zn74a07.r1 Slralagene NT2 neuronal pr 
C50O14O8*:gi|12621134|ref|NP_075244,1| M 




2.53 
















A1692649 




hypothetical protein MGC13168 




2.60 


400840 






ESTs, Weakiy similar to B28096 line-1 pr 




2.25 








gb:RC2-ST01 68-091099-01 1-d05 ST0158 Homo 








AI72QQ78 


Hs.291997 


ESTs, Weakly similar to A47582 B-cell gr 








AW799279 




gb:RC0-UM0051-210300-01 2-ti06 UU0051 Homo 










Hs.128501 














gb.7e35d09.r1 SlratEgene lung (9372)0) H 


1.56 


264 




NNL018949 


Hs.1 92720 










AA654320 




Homo sapiens cDNA FLJ12304 fis, clone MA 




in 




AW854339 




hypothetbal protein FLJ11937 






406584 










2.27 




AA256374 








1.37 












l!l6 


422132 


ABO02337 


Hs.1 12078 


KIAA0339 gene prodilcl 


l]08 






Hs.197354 




1.19 


200 




AA364133 


Hs.323443 


hypothetical protein FIJ11806 










Hs.272009 


ESTs, Weakly similar to ALU2_HUHilAN ALU S 


1.48 


246 




AK0021 85 




hypolhetical protein FLJ11323~ 










H5.85289 


CD34 antigen 


2!l6 


1J6 




A1459490 




Homo salens cDNA FLJ13S9S fis, clone PL 


1.36 


241 




AA768756 








205 






Hs.237356 




234 


1.36 




AV660906 












AA457367 






1.38 






AW409763 


Hs.50699 


ESTs, Weakly similar to 2109260A B ceil 




2!34 
















AA879470 


Hs.96849 


Homo sapiens cDNA FLJ1 1492 clone HE 




2.60 




AI806770 


Hs.1 2549 
Hs.30258 


ESTs, Weakly similar to 21 09260A B cell 




2.36 
















AL031591 






l!28 


224 












258 








Homo sapiens cDNA FLJ1 1632 lis, done HE 






433282 


BE539101 






o!33 


0.20 




NM 001928 








095 




AW979136 


H3.124629 










AA 134489 


Hs.269379 




l!52 


207 




AW849241 




gb:iL3-CT0215-210200-088-E03 CT0215 Homo 








AA016982 


















U2 


2^28 




BE502240 




bypoliietlcal protein FLJ23342 


1.46 


1.40 




A1797619 
























AW590680 




von Wiliebrand factor 


172 


254 
















AA491286 






liso 


208 
















BE066552 




gb;RC3-BT0333.30030()-017-li08 BT0333Homo 




263 




AW005376 






1^20 


0.98 






Hs,28788 














hypotlietical protein FLJ10S40 










Hs.140002 


ESTs Ivloderateiy similar to ALU7 HUt\ilANA 






















Hs,347520 


gb:yj41 a1 1 .r1 Scares placenta Nb2HP Homo 






441167 


AA921754 


Hs'.2117B1 


ESTs 


1^74 


2.12 


465917 


BE 156765 




gb:RC1-HT0370-120100-012-c09 HT0370 Homo 


1.29 




419058 


AW675039 


H5,1227 


amlnoievuilnate, delta-, dehydratase 


2.04 


1,63 


408651 


BE266928 


Hs,17126 


hypothetical protein MGC15912 


1.30 




442737 


AB002319 


H5,B663 


KIAA0321 protein 


0.85 


0,79 




T51588 




gb:yb27e06.s1 Stratagene fetai spleen (9 


1.23 


0,97 


447492 




Hs,20188 


ESTs 


1.26 


2.28 


437840 


AA884836 


Hs,292014 


ESTs 


205 


229 


412294 


AA689219 


Hs,117176 


poly(A)-binding protein, nuclear 1 


1.47 


3.56 


419909 


AL1 36553 


Hs,93675 


decidual protein induced by progesterone 




218 


432569 


A1131140 


H5.152434 


ESTS 


1.34 




412252 


AW903782 




gb;CI^NN103M90400-527-g09 NN1032 Homo 


1.26 


2,00 


444298 


Z1737Q 




gb:HSDH1i020 Stratagene cDNA ilbraiy Hum 


1.36 


268 


445261 


n9759 


Hs.250651 


ESTs, Weakly similar to 138022 hypotheti 


0.96 


223 


418315 


T0B475 


Hs.124962 


Homo sapiens, clone iMAGE:3510191, ml^, 


0.82 


0.82 



185 



wo U3/003906 



PCT/US02/21338 



gb:PMl-DT0064-231299-002-c02 DT0054 Homo 



433762 
418156 
409282 
425169 
458497 
406673 
442691 
424316 
444608 
447345 



403214 
404495 
443471 
437116 
461357 



407140 
429187 
423614 
429073 



402654 
457974 
405340 
426259 
442237 
456370 
407041 



BE247767 
AW979249 
D42046 



AL049253 

AB020640 

AW807321 

AI597806 

AI628633 

AA059106 

AA447648 

AI4S7640 

AA44B167 

AA187566 

U55984 

AF001548 

BE244332 

AW842353 



gb:EST3785S4 MAGE resequences, MAS Homo 



ESTs 

KIAA0870 protein 

gb:EST391359 MAGE resequences, MAGP Homo 
DNA2 (DMA repIcaSon hellcase, yeast, li 
NM 016232*:Homo sapiens Interleuliin 1 re 
C8001441*:gi|89230B1|ret|NP_0601 14.1| tiy 
Homo sapiens ctone FLB3442 PRO0872 mRNA, 
ESTs 



gb;ty77d05.x1 Na.CGAPJ<id11 Homo sapien 
ESTs, Wealdy simiiar to i38022 iiypoUietl 
ESTs, Wealdy simiiar to S65657 alpiia-IC- 



ESTs, Weakly similar lo 2109260A B ceii 
heat sliockSOkD protein 1,alplia 
myosin, heavy polypeptide 1 1 , smoolii mus 
adaptor-related protein complex 3, mu 2 
Target Exon 

ESTs, Weakly similar lo S22765 iieterogen 
C2O02952.'gi|t345964|sp|Pl0O79|rBP1_STRPU 
ankylosis, progressive (mouse) liomdog 
ESTs, Weakly similar to KBF3_HLMAN NUCLE 
ESTs 

gb;Human pTR2 mRNA lor repetitive sequen 



404418 
447658 
434414 
400834 
449542 
441043 



liypotheiical protein FU10477 



416321 
426789 
443679 
428554 
401890 
419501 
457096 
426123 
449445 
430683 



457269 
427877 
454631 



408855 
442151 
412708 
417262 
419362 



Hs.213424 ESTs 



Hs.192104 ESTs 



gb;tr34b07.x1 NCLCGAP_Ow23 Homo sapiens 
HU 002240':Honio sapiens potassium inward 
ESTs, Weakly similar lo 138022 iiypolheli 



AI631602 
D63477 
F06596 
AK001810 



AC0C4662 
Ai744995 
AW875g72 
AW013915 



/VI 38725 

AI792585 

AA700856 

AA399964 

T83061 

AI733404 



C3001164*:gi|1730196|sp|P50573|GAR3_RAT 
ESTs 

KIAA0143 protein 

Homo sapiens cDI* FU11812 fis, clone HE 

hypothetical protein FUiag48 

ESTs 

gb;CM4-CN0046-01020O-514-IO8 CN0045 Homo 
ESTs, Weakly simiiar to cerebroside sulf 
gb:EST82246 Prostate gland I Homo sapien 
ESTs 

Homo sapiens PAC clone RP4-697H17 fram 7 
ESTs, Moderately similar to ALU4_HUMAN A 
gb:CM3-PT0014-071299-061-b05 PT0014 Homo 



gb;IL3-ST0141-131099.017-A02 ST0141 Home 

ESTs, Weakly similar to ALUC.HUMAN III! 
ESTs, Weakly similar to 178885 serineAh 
ESTs 

Homo sapiens mRNA lor KiAA1727 protein. 
ESTs 

ESTs, weakly similar lo CGHU7L collagen 
ESTs, Moderately simiiar to B34087 hypot 
hypothetcai protein FU2ig45 



wo U3/003906 



PCT/US02/21338 



50 
55 



447248 
415622 
414065 
414586 
443197 
428266 
447083 
412302 
445555 
453117 
436757 

430657 
438744 
439326 
438117 
401686 
420269 
434288 
433215 
413429 
426417 
413882 



459645 
449691 
425965 
437272 



Z43613 
AI382001 
AI472124 
AW936334 



AA482910 
BE314727 
AF086139 
A/\328041 



AI878982 
AA074346 
AA002143 



AW01SS88 
AW235215 
FSB773 



Homo sapiens cDNA FLJ13580 lis, cione PL 
liypotlietical protein MGC16291 
gb:HSC1GD091 normalized infant brail cDN 



;.1 67757 ESTs 



gb:QV4-DT0021-281299-070-g05 DTQ021 Homo 



Hs.104203 
Hs.293404 
Hs.291065 
Hs.279664 



Hs.1 47221 

Hs.206312 

Hs.22247 

Hs.250585 

Hs.336224 

Hs.124183 

Hs.65907 

Hs.159282 

Hs.1 29875 

Hs.306508 

Hs.30974 

Hs.1 29827 

Hs.6653 

Hs.131859 

Hs.250715 

Hs.21413 

Hs.248549 

Hs.1 66975 
Hs.103853 
Hs.11637 
Hs.146924 
Hs.84264 
Hs.137232 
HS.1614S 
Hs.268883 



408366 AW5112SS 
441359 Ai435179 
413068 
441322 
409124 
432413 
426391 
443861 
464609 
426893 
443611 NIVl_014397 
410369 R38624 
406308 

432476 T94344 
436073 AA654078 
420681 AA278459 



profilin 1 

Hs.1 60423 cyolin-dependeni Idnase 9 (CDC2-related 
Hs.194329 liypotlietical protein FLJ21174 

NM_01458r:Homo sapiens SKi (sex detenni 
Hs.96264 alpha lhalassemla/msntai retardation syn 
Hs.1 16265 MilnS 
Hs.191098 hypothetical protein FUliSSS 
Hs.278881 
HS.132S4 
Hs.184492 



ESTs 
ESTs 
ESTs 

gb:zi24h06.r1 Soares_pr8gnanLuterus_NbH 

ESTs 

ESTs, Weakly similar to 138022 hypotheti 

ESTs 

ESTs 

transmembrane protein with EGF-llkeand 
ESTs 

Homo sapiens, done IMAGE:3959816, mRNA, 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp76201415 (t 



Homo sapiens F-box protein FBX1 1 mF!NA, p 

ESTs 

solute carrier family 12, (polassiunvchl 
ESTs, Moderately similar to S65657 alpha 
gb;EST388066 MAGE resequences, MAGN Homo 
splicing factor, arginine/serine-rich 5 
hypothetical protein FLJ20043 
Homo sapiens mRNA; cDNA DKFZp547J125 (li' 
ESTs 



ilicproteir 

ESTs, Weakly similar to S656S7 alpha-1 C- 



Hs.1 30628 
Hs.50727 
Hs.274505 
Hs.1 60672 
Hs.134743 



gb-.QV3-Br0295-260100-066-d06BT0295Honio 



Hs.326263 ESTs 



gb:MR4-ST012M21199-017-d08 ST0125 Homi 
gb;ad43b07.s1 Stratagene lung carcinoma 
NIMA (never in mitosis gene a)-relaled k 

ESTs 

NM_026192;Homo sapiens hypothetloai prot 



Hs.151940 ESTs 



gb;ac04a05.s1 Stratagene lung (937210) H 



412149 
455116 
449626 
410047 



AW857271 
AA774247 
Ai167810 



gb:CMO-CT0307-210100-158i|09 CT0307 Homo 

zinc rmger protein 258 

zinc linger protein 36 (KOX 18) 



wo U3/003906 



PCT/US02/21338 



50 
55 

60 
65 



41BB65 


AW117500 


Hs.104241 


ESTs 


1.58 


2.63 


402762 






ENSP00000235171*:GAP luncHon beM pro 


0.81 


0.82 


436449 


AI418027 


Hs.120351 


ESTs 


1.46 


1.46 


403488 






ENSP000Q0201948:KARYOPHERIN BETA2B HOMOL 


1.3B 


Z23 


431235 


M318271 


Hs.250905 


hypothetical protein 




2.55 


448676 


AB026730 


Hs.21495 


UDP-GalibetaGlcNAc beta 1 ,3^alacto5yltr 


0.70 


0.78 


408100 


AW205382 


Hs.42676 


KIAA0781 protein 


1.36 


2.66 


433436 


AW162474 




Bruno (Drosophila) -like 6. RNA Nnding 


1.50 


2.15 


422337 


R38572 




gb:yc87c11,s1 Soares Infant bran 1NIB H 


2.23 




426160 


AA206020 


Hs.1 67460 


splicing factor, arglnlne/serine-rlch 3 


1.08 


209 


447008 


BE010189 




nuclear receptor subfamily 1, group i, m 


1.26 


1.27 


420141 


AA702961 


Hs.124103 


ESTs, Weakly similar to 133344 titin, ca 




260 


423840 


AA332434 


Hs.72465 


ESTs, Weakly similar to non-lens beta ga 


1.26 


2,47 


447793 




Hs.211203 


ESTs 


2.38 


1,83 


407328 


AA508857 


Hs.187748 


ESTs, Weakly similar to ALU1 HUMAN ALUS 




2,64 


432451 




Hs.292471 


ESTs, Weakly similar to ALU1 HUMAN ALUS 




2,05 


421311 


N71848 


Hs.283609 


hypothetical protein PRO2032 


0.51 


0,44 


444649 


AW207523 


Hs.197628 


ESTs 




2,24 


448683 


R94570 


Hs.266869 


ESTs, Weakly simlar to ALU1_HUMAN ALU S 


1.91 


3,25 


428847 


AI954833 


Hs.98881 


ESTs 


1.48 




413750 


B El 61 453 




gb:IL2-HT0437-290200.045-A06 HT0437 Homo 


1.22 




429355 




Hs.292689 


ESTs 


1.86 


2,35 


427793 


AA412499 


Hs.104779 


ESTs 


1.82 




431179 


A1338644 


Hs.195432 


aldeliyde dehydrogenase 2 Family (mltocho 


0.80 


2.00 


451719 


AI373532 


H5.157910 




1.29 


3.85 


438094 


A1821755 


Hs.131805 


ESTs, W^ltly similar to A561 94 Ihromboxa 


1.74 


2.54 


418504 


BE159ri8 


Hs.85335 


Homo sapiens mRNA; cDNA DKFZp564D1462 (f 


0.62 


0.49 


407414 


AF072164 




gb:Homo sapiens HSFE-1 mRNA, partial cds 


1,67 


2.28 


416410 


H53777 


Hs.36822 


ESTs 


1.85 


2,28 


439141 


A1241470 


H5.268982 


ESTs 


1.08 


2.28 


441181 




Hs.121076 


peptidylprolyl isomerase (cycloplillln)-l 




202 


434482 


AF143331 


Hs.16073 


ESTs 


1.22 


ZOO 


455757 






gb:RC5-BT0624-24030O-013-D08 BT0624 Homo 


1.53 


Z16 


425787 


AA363867 


Hs.155029 


ESTs 




213 


405727 






CX001244:g!|11420428|reflXP_004814.1| be 


1.70 


2.21 


441846 


AW850980 




gb:lL3-CT0220-150200-06B-B03 CT0220 Homo 




214 


461945 


BE504055 


Hs-211420 


ESTs 


0.84 




438432 


AW444990 


Hs,258800 


ESTs, Weakly similar to 138022 hypothetl 


1,60 


2.43 


461140 


AW411354 




LIU domain binding 1 




1.20 


407341 


AA918886 


Hs.204918 


ESTs, Weakly similar to ALU8.HUMAN ALU S 


1.03 


2.42 


453041 


AI680737 


Hs.289068 


Homo sapiens cDNA FU1191B fs, clone HE 


1.69 


3.43 


437613 


R19892 


Hs,10267 


MiLI protein 


1.16 




461607 


AW291109 


Hs,208787 


ESTs, Weakly similar to T31 61 1 tiypotheti 


1,22 


205 


430259 


BE550182 


Hs,127826 


RalGEF-IIke protein 3, mouse homdog 




1.00 


453669 


AL049029 




hypoltetlcal protein FU22021 


0,75 




455066 






gb:RC3-CT0255-200100-024-g10 CT0256 Homo 




220 




AL037800 


H5.814e 


selenoprotein T 


0,60 


0.18 


437936 




Hs.288549 


hypolhetteal protein FLJ14710 




244 


442556 


AL1 37761 


Hs,837g 


Homo sapiens mRNA; cDNA DKFZp586L2424 (f 


0,54 


0.37 


405223 






Target Exon 


1,09 


280 


437225 


AW975982 


H5,292935 






247 


421101 


AF010446 


Hs.101840 


mgor histocompatiblilty complex, class 


0,72 




436200 


R513B6 


Hs.124881 


ESTs 


1.64 


293 


402025 






Nk4_021624:Homo sapiens histamine H4 rece 


1.52 


228 


407019 


U49973 




gb:Human Tiggerl transposaUe element, c 


2.40 


212 


451305 


AW003571 


H5.211191 


ESTs, Weakly similar to A4601 0 X-llnked 


1.24 


a23 


423450 


AJ290445 


Hs.128759 


KIAA0524 protein 


1.64 


213 


423139 


AW402726 


Hs,288560 


hypothetical protein FU21 106 




228 


451763 


AW294647 


Hs,233634 


hypothetical protein FLJ14220 


1,39 


208 


468915 


AI915689 


Hs.212781 


EST 


1.62 


202 


452829 


A1955579 




ESTs, Weakly similar lo TRHY.HUMAN TRICH 


0.60 


0.41 


446383 


T05816 


Hs!92511 


ESTs 


2.08 


1,48 


432576 




Hs,165954 


ESTs, Weal<ly similar to 138022 hypothetl 


1.88 


249 


433820 


AI401627 












AA844700 


Hs!39297 


ESTs, Moderately similar lo ALU1 HUMAN A 




2.00 


416868 


H06728 


Hs,21017 


ESTs 


134 




420738 


NM_004185 


Hs.268575 


wingiess-type MMTV integration site fami 


1.42 








Hs.15562 


hypothetical protein FLJ10871 


0.79 


o!78 


404167 






m 021 966':Homo sapiens glutamaterecept 


1.62 


255 


417074 


Z49878 


Hs,81131 


guanidinoacetate N-methyltransferase 


0.72 


0.76 


401215 






C1200045r:gi|7512178|pir||T30337 polypr 




208 


421600 


AW893889 


Hs.323231 


Homosapiens cDNA FU11946 lis, clone HE 


188 


266 


426248 


T1S933 


Hs.293668 


ESTs 




3.44 


464623 


AW803980 




gb:PMO-UMOOB4-240300-001-G11 UM0084Hoiiio 


l!34 


2.40 


420656 


AA279098 


Hs.1 87636 


ESTs 




2.43 


402833 






C100250B;gi|6691937|emb|CAB65797.1|(AL0 


l'.31 


200 


438910 


AA827921 


Hs.291858 


ESTs, Wealdy similar to ALUC_HUMAN 1111 


1.39 


3.13 



wo U3/003906 



PCT/US02/21338 



416170 
433598 
417699 



Hs.271433 ESTs,ModeratelysimllartoAUJ2.HUMANA 



429791 
438695 
458139 
413035 
422444 
409546 
411432 



55 

60 



AL045773 

R10799 Hs.191990 

AL119206 Hs.126257 

N77624 Hs.173717 

AF083118 Hs.283968 

A1908774 Hs,259785 

AI148976 Hs.1 12062 
AW984479 

AL040832 Hs.l 60422 

AF227131 Hs.272387 

AW015667 Hs.119427 

A1885190 Hs.1 56089 

A1525711 Hs,253147 
BE1SSS63 
AA310688 

AW410190 Hs,250624 
AW846272 

AI220082 Hs.147722 

AB011136 Hs.151385 

AW34070B Hs.317714 
BE081731 

AA316987 Hs.129846 

AI627478 Hs.1 87670 

AA112090 Hs.269961 

D26361 Hs.3104 

AA766178 Hs.291601 

AA244000 Hs.222365 

AA968459 Hs.1 53785 
AF033579 

AA166653 Hs.268171 
A19Q3765 

NM_012098 Hs,8025 

AW935411 Hs.314460 

AA233094 Hs.191517 

BE549905 Hs.231754 
AL1 37967 

AA302741 HS.257B6 



403197 
432407 
414996 
401016 



AA381797 
A1734179 
AIJ044659 



AA684134 
BE244127 

437722 AW292g47 

452277 AL049013 

425712 AA412548 

427598 AA406057 

412565 M85975 

422043 AL1 33649 

421814 L12350 

413645 AA1 30992 

435563 AF210317 

462396 HI 0302 

440612 BE561384 

454721 AW815588 

417796 AA206141 

432864 D15217 

454480 AA088375 



418797 
403871 
441283 
442250 
466747 



AA927670 
AW290871 
AL037357 
AA3B3171 



gb:DKFZp434F246_r1 434 (synonym; lilBs3) 

ESTs 

ESTs, Weakly similar to AUJ1_HUMAN ALU S 
phosphatidic acid phosphatase lype 2B 
Homo sapiens CATX-2 mRNA, complete cds 
carnlline palmitoyltransterase 1, liver 
ESTs 

gb;PM1-HNC012-220300-001-b12 HN0012 Homo 
Homo sapiens clone PP902 unknown mRNA 
tasle receptor, lype 2, member 4 
ESTs 

ESTs, Weakly similar to repressor prolel 
ESTs 

gb:PM4-HT0352-171199-001-C05 HT0352 Homo 
gb;EST181501 JurkalT-cellsVHomosapie 
hypothetical protein MGC4473 
gb;QV0-CT01 79-300999^24^12 CT0179 Homo 



gb:QV2-BT0635-220400-158<04 BT0635 Homo 



Hs.281121 
Hs.105676 
Hs.43791 



C6000989':gi|7573285|ent|CAB87644.1|(AL 
ESTs 

gb;DKFZp761D231£Lr1 761 (synonym; hamy2) 
ESTs, Moderately similar fc)JCS238galac 

Target Exon 

gb:MR3-ST0191-020200-207-d04ST0191 Homo 



C2002793*:gi|135314B|sp|Q0956a|YR86_CAEE 
gb;zr03n 2.r1 Stratagene NT2 neuronal pr 
hypothetical protein FU2193S 
ENSPC0000227126;NAALADASEli PROTEIN. 
ESTs 

gb:TCBAP1E0661 Pediatric pre-Bcell aout 
ESTs, WfeaMy similar toJU0033hypotheb- 
KIAA1223 protein 

ESTs, bkxierately similar to ALU1.HUMAN A 

ESTs 

gb;EST02500 Fetal brain, Stratagene (cat 

retinoic acid induced 1 
thrombospondin 2 

gb:zo15e02,s1 Stratagene coton (937204) 
solute carrier family 2 (facilitated glu 

ESTs 



Hs.279607 
H5,22612 
Hs.15246 



Hs.131704 
Hs.129121 
HS.12S864 



wo U3/003906 PCT/US02/21338 





M4520e8 




Opannteracting protein 2 


1.26 






AK0CQ156 


Hs,272193 


Homo sapiens cDNA FLI20149 lis, done CO 


1.46 


2.60 




AI733852 






1.62 


210 




AA694070 


Hs,268836 






253 




AA808027 


Hs.123277 




US 






AI824879 




ESTs, Wealily simiiar to 1 207289A reverse 








AW936300 




gb:Q\/4-DT0021-281299-070-a04DT0021 Homo 








AA679082 




hypothetical protein DKFZpS64K192 




ao8 




AW206285 


Hs, 253548 












Hs.168812 


Homo sapiens cDNA FLJ14132 ils, clone MA 


















404031 


AW613780 


Hs,13500 


ESTs 

C6001700*:gi|9256616|re11NP 051761.1| pr 




o!l7 
2.29 


457412 


N40711 


H5.333300 


hypotlieticai protein FU14026" 




3.20 






Hs,326048 


Homo sapiens mRNA; cDNA DKFZp434U0420 (1 


162 


230 
















AVV954011 






0^92 


220 








Homo sapiens, done MGC: 17421, mRNA, com 










HS'302634 










AI698091 


Hs.107B45 






2,25 








Homo sapiens mRNA; cDHA IDKFZp434N011 (fr 






410357 






gb:hj22e04,x1 NCI.CGAP.LIB Homo sapiens 


a69 


a59 








gb;CM0-CTflQ52-150799-024-c04 CT0052 Homo 








NM 014923 


Hs-103329 














gb:EST382704 MAGE resequences, MAGK Homo 




















BE299590 




gb:QV4-ST0023-160400-1724i10 ST0023 Homo 




















A1674710 








205 




AW439340 












A1051603 












AI698138 


H5,126918 




l!40 


2.36 




AI984203 






0.88 


216 




AW811114 




gb;MR2-ST0131-111199-016-a04ST0131 Homo 


1X1 


231 




A1936264 








2^45 


401908 






CI 70001 54;gi|12003980|gb|AAG43830.1|AF21 




2.28 


404730 


























AA158617 














Hs.307972 






















AF083130 




Homo sapiens CATX-1 4 mRNA, partial cds 




1.55 




AI696^4 






132 


238 




AI579909 








0.27 




AA860329 




hypothellcal protein DKFZp434i2117 














1,09 


218 
















BE088433 


Hs,334696 




1.40 


250 
























100 
















AL137445 




Homo sapiens mRNA; oDKA DKFZp5660134 (fr 












ESTs. MGde'alBlyslmiilarto 2109260ABC 






















































440781 


BE561823 


Hs.281434 


Homo sapiens cDNA FLJ14028 fis, clone HE 


128 


2J0 


442662 


U78168 


Hs!8578 


Rapl guanine-nucieollde-exchange factor 


1.92 


228 


443078 


M78728 


Hs.1 32694 


Homo sapiens cDNA: FU2314g fis, clone L 


1.42 


203 


440179 


AI990151 


Hs.1 25904 


ESTs 


1.49 


2.63 


446780 


R31107 




gb:yh61g01.5l Scares placenta Nb2HP Homo 


1.96 


2.78 




AI126432 


Hs.149493 


ESTs 




210 


417939 


R53863 


Hs.337512 


ESTs, Weakly similar to ALUS HUI*IN!!!! 


160 


203 


428490 


BE30173B 


Hs.49806 


ESTs, Weakly similar to A46010 X-llnked 


0.47 


0.44 


443869 


AI141520 


Hs,151464 


ESTs, Weakly similar to ALUC HUlWl !!!! 


1.26 


2.68 


426322 


J05068 


HS.2C12 


transcobalamln i (vitamin B12 binding pr 


212 


1.15 


411630 


U42349 


Hs.71119 


Putative prostate cancer tumor suppresso 




0.48 


454701 


AW854930 




gb;PM0CT0263-201099-003-fD6 CT0263 Homo 


130 


233 


439795 


N77294 


Hs.194294 


ESTs 


1.17 


233 


425546 


BE409762 


Hs,26118 


hypothetical protein I^GCI 3033 


1.17 


285 


411245 


AWB33441 




gb;QV4-n0008-271099-020^1 n0008 Homo 


1.90 


3,98 



190 



wo 03/003906 



PCT/LS02/21338 



434967 
425724 
446847 
453216 
421718 
415924 
450850 
443153 
434420 
426126 
421926 
459563 
453006 
437223 
417016 



416640 
403146 
457397 
439189 
423969 



431531 
441077 
426799 
419480 
455908 
403332 
455753 
404429 
438941 
428745 
411567 
458714 
426839 
444539 
407322 
453826 



AF283775 Hs,35380 
AA362525 

T51454 Hs,82845 

AL137566 Hs,32405 
AL1 17574 

HI 8047 Hs,335821 

AA648886 Hs,151999 

A1371823 Hs.34079 

AA688278 Hs,194864 

ALII 8747 Hs,26691 
AA300591 




gb:EST72223 Namalwa B cells 1 Homo sapie 
Homo sapiens cDNA: FU21930 fis, clone H 
Homo sapiens mRNA;cDNADKFZp586G0321 (f 
Homo sapiens mRNA; cDNA DKFZp434L2221 (f 



hypolliellcal protein FLJ22578 



Homo sapiens cDNA FU1436B fIs, done HE 
ESTs 

ribosonnal protein L4 

gb:EST63098 Jurkat T-cells V Homo saplen 
NfcL014115*:Homo sapiens PRO01 13 pralein 



449832 AA694254 



AA521381 Hs.187726 
BE142052 Hs.62654 
AI241273 Hs.15312 



AW969025 

AI961186 Hs.144630 

AI830671 Hs,34969 

A1674906 Hs.199460 

AK001B90 HS.1726S4 

BE62Z743 H3.301064 

N93047 Hs.19131 

AW811978 Hs.254037 

AAg09619 H3.112668 

NS6605 Hs.269aS3 

U41514 Hs.80120 

AL119470 H3.145631 

AW958D42 Hs.95870 
U04691 
AW813786 

H01613 Hs.50628 

AW631111 Hs,249727 

AW238410 Hs,253888 



KIAA1806 protein 
hypotheHcat protein K1AA1165 
gb7j45c03.r1 Soares placenta Nb2HP Homo 



ESTs, Moderateiif similar to 16091958 bio 
aldehyde oxidase 1 
oilron (rho-inleraoUng, se 



nuclear receptor subfamily 2, group F, m 
hypothelicai protein Di<FZp566N034 
gb™c73l02.x1 NCI_CGAP_Pan1 Homo sap 



Hs.60049 ESTs 



transcription facta Dp-2 (E2F dimerizat 



UDP-N-acetyl-alpha-D^alactosamlne:polyp 
PTDOIS protein 

gb:Hunnan olfactory receptor (0R1 7-219) g 
gb:RC3-ST0197-120200-015-b05 ST0197Homo 
adaptor-related pratein complex 4, sIgma 
gb:hh92e12.y1 NCI_CGAP_GU1 Homo sapiens 

ESTs 



BE075124 

AF075047 Hs.31864 

AA433896 Hs.201634 
AW851630 

R20916 Hs.344777 

M74782 Hs.1 72689 

Ai955765 Hs.146907 

AA171892 Hs.324570 
AL13B129 

AA694324 Hs.257e75 



kringle-conlaining transmembrane protein 
ESTs 

popeye prateiu 3 
hypothelicai protein FU23017 
gb:QV0-HT0357-15O200-114-h04 HTC367 Homo 

Target Exon 

gb:PM1-BT0585-1 10200-003-h02 BT0585 Homo 

Targel Exon 

ESTs 

ESTs 

gb:MR2-CT0222-211099-002-h05CT0222Homo 
ESTs 

Interteukin 3 receptor, alpha (tow afSn 
ESTs, Wealdy simitar to 2004399A ohromos 
ESTs, wealdy similar to ALU3_HUMAN ALU S 
gb:Di<FZp547F152_r1 547 (synonym: Hfbrl) 



wo U3/003906 



PCT/US02/21338 













3.08 


417759 


R13567 


Hs.12548 


ESTs 


163 


258 


417527 


AA203524 




gb:zx56e10.r1 Soaresjetaljiverjspleen. 


1.52 


2.02 




M405062 


Hs.345830 


gb'.zu12e04,r1 Soares~lestts NHTHoims^ 








AW891707 




gb;CM3-NT0090-040500-17i'-e02 NT0090 Homo 


l!l2 


2.20 








hypothefcal protein DKFZp564F013 


0.93 


2.28 




AI921875 




gb:wp07e04j(1 NCI_CGAPJ<ld12Hoinosapien 








N59555 




gb:yv76l05 j1 Soar^ fetal liver spleen 


1.61 


2.80 










1.11 


2,10 




AF072164 


Hs.1 37570 


Homo sapiens HSFE-1 mRNA, partial cds 


2.04 






MM 012331 
















gb:EST62799 Jurkat T-cells V Homo sapisn 


0^99 


2.08 




W28339 
















CI 2001 521 ;gl|7513934|pir||T31 081 coa3 pr 


230 


1 00 




A1088196 




Homo sapiens clone IMAGE:451939, mRNA se 


1.21 






AA287987 




ESTs, Weakly similar to 1 207289A reverse 


1.26 


260 
















AI468079 












W90337 




ESTs, Moderately similar to 2109260A B c 


1.56 


223 








KiAA1105 protein 








AA400352 


Hs.1 12861 




154 


Z05 






Hs.295655 


ESTs, WeEl<ly similar lo PC4259 ferritin 


1.48 


Z15 








gb.'yi51h07.r1 Scares placenta NbJHPHomo 


1.42 


Z78 




AW444662 






1.50 


2.48 




BE299567 




ESTs, liiloderately similar to ALU8_HUMAN A 


1.31 


2.25 








gb:ye20g09.s1 Stratagene iung (^210) H 


2.10 


1.69 




AI986160 


Hs.180383 


dual specificity phosphatase 6 


0.99 


2.33 








Homo sapiens cDNA: FU23573 lis, done L 


2.14 


1.61 




BE501816 






1.76 


2.83 




AW361626 




hypothetlcai protein FLJ11240 


0.41 


0.28 










0.76 


3.40 




AA281504 




Homo sapiens cDNA: FIJ22165 lis, done H 


216 


1.65 




AA442498 




ESTs, IModerateiy similar toZ19SJHUMAN Z 


1.30 


2.00 










1.87 


2.23 


403344 






HU 000341 :Homo sapiens sdute carrier fa 


1.36 




418056 


AA524886 




gb:nh34f02.s1 Na_CGAP_Pr3 Homo sapiens 


1.42 


2.85 




AI791746 


Hs.1 30293 




2,44 


1.32 




AA811657 


Hs.220913 




1.32 


2.08 




AW196772 


Hs.131323 




1.80 


2.65 




AA724771 






1.64 


2.18 




AI369787 












AI246193 






u^ 


2^83 




A1990790 


Hs.188614 




1.48 


2.45 




AB021866 




potassium voltage-gated channel, Shal-re 


1.70 








Hs.205980 




1.83 


ioA 










0.58 


0.43 








Target Bcon 


1.47 


3.75 








Target Exon 


2.00 


1.91 






Hs .326159 


leucine rich repeat (in FUi) interadln 


0.33 


0.21 


433390 


AA586950 


Hs.260180 


Homo sapiens mRNA; cDNA DKFZp761G18121 ( 
ESTs 


2.00 


4.90 


451443 


AW295527 


Hs.210303 


1.87 


2.25 


411188 


BE161168 




gb:Pi\ilO-HT0425-170100-002-a10HTa425Homo 


2.15 


1.69 


462704 


AA027823 


Hs.149424 


Homo sapiens PNAS-130 mRN^ complete cds 


2.64 


1.65 


424060 


X92108 




H^apiens mRNA forsubteiomeric repeats 


Z40 


258 




A1738815 


Hs.1 17323 


ESTs 


1.46 


210 


42862D 




Hs.1 84736 


hypotheScal protein FU10097 


0.44 




439492 


AF086310 


Hs.1 031 59 


ESTs 


0.42 


0.26 


426736 


AA431615 


Hs.1 30722 


ESTs 


1.90 


245 


416226 


AA577730 


Hs.188684 


ESTs, Weakly similar lo PC4269 lerrifin 


2.72 


6.25 


404917 






Target Exon 


1.60 


2.16 


448955 


AW207597 


Hs.28102 


ESTs 


2.08 


1.76 


402797 






Target Exon 


2.12 


1.37 


457951 


U23860 




gb:Human ctone mcag19 chromosome 16 CTG 


1.72 


200 


426982 


AA149707 




ubiquitin-like 3 


0.36 


0.17 



TABLE 8B 

Pkey; Unique Eos probeset identifier number 
CAT number: Genecluster number 
Accession: Genbank accession numbers 



AA451966 NM 016370 AB036693 AL139228 R68124 AI634847 AL1 19333 W07356 Ai334284 H29060 AI192685AA662438 AW172843 W19794 
N21460 AI743862 AW130622 AA991348 AI204553 AA992664 N80848 AA699329 Ai824676 R26624 R49653 

AW807321 AW807262 AW177104 AW807319 AW807115 AW807344 AW807324 AW178116 BE141575 AW845849 AW807105 AW845868 
BE140942 AW807178 AW807167 AW807398 AW807320 AW807306 AW845866 

192 



Ptey CAT Number 
408139 10421_1 

408266 1049361_1 



wo 03/003906 



PCT/LS02/21338 



35 
40 



413068 
413196 
413346 
413444 
413489 
413593 
413645 
413656 
413750 



414051 
414623 
415054 
415110 
415157 
415361 
415378 
415449 
415770 



416463 
417086 
417117 
417324 



110159J 
111512J 
1115745J 
1123651_1 
114831.3 
114869 1 
1157906_1 
117950J 
1197159J 
1213736J 
1218207.1 
121847_1 
1219485.1 



1227882.1 
1229027.1 
1229373.-1 
1233793.1 



1236412.1 
1239834.1 
1245636.1 

1248073 1 
1249774_1 
125064.1 



1263110.1 
1265807.1 
1265812.1 



1276467.1 
1285293.1 
128647.1 
1288098.1 



1348104.1 
135322.1 
135323.1 
1370831 1 
1373392.1 
137e91J 

1381628J 
1386250.1 
1397288.1 
139750 1 
1413185J 
1466952 1 
151827.1 
1522905.1 
1525616.1 
1534970.1 
1535274^1 
1537026 1 
155437.1 



164830.1 
1651230.1 
166714.1 



BE141579 AW807556 AW807502 BE141596 AWB45846 AW807600 AW845B64 AW807480 AW807486 AW807478 AW178109 AW807228 

AW807374AW807125AW846124AW807470AW807477AW807510AW807208BE141573AW807465 

AW277084 R27662 R26970 D791 94 

AAa63426 AW962323 AW408063 AA063503 AA772927 AW753492 BE175371 AA31 1 147 

AW966480 AA069840 AA38464B 

AW373472 AW373484 BE071899 BE071898 

AW382767 BE153e35 BE163702 BE153572 

Al 1 50485 AW938392 AA076894 AW883422 

AI500724 AA399661 AA397B91 AI471084 A1423511 T07531 AIC94336 

BE081731 AW861738 AW503629 BE081969 



AW663614 BE046540 BE045760 AW827443 BE046544 
AW792784 Z44444 H06639 

AW799279 AW799395 AW799392 AW799276 AW799479 H57B85 

AW368860 AA457091 A1903441 AA088B23 W88852 AW979154 AA826016 R94779 

H86002 W92289 AW801558 AW801324 AW801270 AW801307 AW801351 AW8013S7 AW801299 AW801609 AW801356 AWB01420 
AW80142S AW801358 AW801429 AW801428 AWB01427 AW801424 AW801306 

AW806924 AW866537 AW866473 AW866298 AWB66390 AW866478 AW866454 AW866309 AW866539 AW866521 AW866547 AW866517 
AW866403AW86636g 



AW813238AW813474AW813334AW8ie0B1AWB13296AW813363AW81 3397 AW813327AW81 3328 AWB16031 AW816140 
AW813788 

AW819561 AW819682AW819563AW8196BBAW819499AW819498AW819690BE065081 

AW821117 AW855641 AW855405 AW855374 

BE161168BE162466AW821260 

AW833441AW833552AW833700AW833610AW833673AW833675 

AW8381 26 AW838294 AW838247 AW838251 AWB38292 AW838299 AW83B374 

AW846272 AW846564 AW846545 AW8462B5 AWB46135 AW846317 AW846200 AW846265 AW846326 AW846196 AW846357 AW846153 
AW846286 AW846319 AW846277 AW8463B1 AWB46438 AWB46481 AW846352 
AW849241 AW849569 AW849243 

AW851630 AW851703 AW851735 AW851723 AWB51708 AW851712 
T95183T64070AA094134 

AW853441 BE145228 BE145218 BE145162 BE145283 
AW855393 AW855660 AW855424 
AW872477 BE088101 T0S990 

AW876354 AW876179 AW876318 AW876290 AW876234 AW876125 AW876199 AW8761 98 

AW876263 AW876257 AW876261 AW876273 AW876231 AW876398 AW876334 AW8761 34 AWB7S371 AW876308 AW876314 AW876328 
AW876 1 69 AW876331 AW876426 AW876407 AW87B412 AW876322 AW876363 AW876218 AW876240 AW8761 41 AW8761 38 AW876325 
AW876181 AW876131 AW876378 AW876206AW876191 AW876188AW876376AW876289AW876415 AW876209 AW876366 AW876343 
AWe76347AW876165 ' 



AW936334 AW936371 AWg36474 
BE1447B2AW979091 

BE008018 BE008025 BE008026 BE007959 BE007994 BE00B016 BE00B019 BE00B024 BE008022 BE00B027 BE008029 1 
BE008021 BE008028 BE008023 BE008030 BE008014 



BE141019BE141473BE141529 

BE144228 BE144291 

AA205248AA1 30658 AA204737 

AA130992 AA503835 AW96g537 

T91703 BE165222 BE155240 BE155274 BE155275 T92469 

BE161453W23803 

BE177442 BE177439 BE177445 BE177440 BE177448 BE177444 BE177433 
AA6802B8 AA315655 AA133031 AA377748 
BE244127BE246216 
BE391050BE389191 BE389697 
AI733907 AA159708 AI732614 
H04043 D60988 D60337 
D63257 D78918 D63214 
F06724Z43690 R21681 
T16964 F07075H10256 
H15034T17195 F09069 
M79237A1: 



AI921875AA170835AA866613 

AI267700 A1720344 AA191424 AI023543 A1469633 AA172056 AW958465 AA172236 AW953397 AA355086 
D80529 D81719 C14833 AA179446 AA357794 
H69241 H57540 R00795 

AA19444B AA194603 AA193162 AA196396 Z24810 
N46778 W8B339 N75221 

AW265494 AA455904AA1 95677 AW265432 AW9916a5 AA4S6370 
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wo 03/003906 PCT/LS02/21338 



419145 
419501 



422269 
422297 
422337 
422444 
422857 
422936 
422940 
422959 



423074 
423622 
3 5 423736 
423977 
424060 



45 425724 
425757 
425893 
425980 
426123 

50 426477 



214851_1 
215447.1 

216695_1 
222190_1 



249780_1 
255206J 
2555091.1 
255956J 
25774?_1 
25877811 
261304J 
267804J 



55 



70 
75 



R79048 R23111 

AA524886 AW971347 AA211537 
AA515814 M515037 AA230024AA228343 

N99638 AW973750 AA328271 H90994 M55802Q AA234435 N59599 R94815 
AA236867 AA237066 AA354236 AW957759 H08961 
AW843822 AW844020 AA825923 AW968657 AW854173 BE084673 
AA287443 AA419385 BE084078 AI478347 

A1910275 X00474 X52003 X05030 NM 003225 AA314326 AA308400 AA506787 AA314825 AI571948 AA507596 AA614579 AA587613 R83818 

AA568312 AA614409 AA307578 A1925552 AW950155 AI910083 M12076 BE074052 AW004668 AA578674 AA582084 BE074053 BE074126 

BE074140 AA514776 AA588034 BE074051 BE074068 AW009769 AW050690 AA858276 R55389 AI001051 AW050700 AW75021 6 AA614539 

BE074046 A1307407 AW6a2303 BE073575 AI202532 AA524242 AI970839 AI909751 BE076078 AI909749 R55292 

AL1 1 7674 A16B1 945 BE503055 AW015330 CI 6652 AA441 856 AA329863 AW962602 AA441 830 AF1 14049 

AA300691AW963893AA300493 

AA307684 AW795791 AW795790 

AW961290AA308338 R60841 BE378716R13001 

R38572 R49597 H51730 F1 0468 AA3091 98 BE011889 AA61 3236 

AA31068BAA355321 AW962134 

R7)461 AA328744AA31812BBE081817 BE081587BE081751 

AA319278AA319712AW961604AW954g66 

BE077458 AA337277 AA3192B6 

AV647015AV647162AA477047AW392066BE168052AA657684AL040747AL043689AL041618W24630AA419002AA778650AI028269 
AW378085 AW780241 A1984255 AW129462 AI271476 AU)40967 AA836991 AL04091 0 AW273346 AA1 32452 AI375545 AI436321 N53166 
AA598667 AA423804 AA889495 A1954063 A1923968 AW1 94576 AI4O0578 AA748499 AA86C230 AW519209 AA339537 AA476935 AA767391 
A1017B95 AI860419 N93437 AW452389 A1051 533 AA806940 AA609569 AI624860 A1356952 AIB11 51 6 AI439 1 98 AA749268 A1206880 
AI784422 A1431 957 Al 1 87038 A1678429 A1273421 A1358604 A1691 1 55 AA586499 AA897667 AA524576 Al 41 7772 AW799936 BE241 923 
AA01 3428 BE1 54987 AW602710 AI933540 AA757772 AW015132 A1954159 AW956377 W69468 AI807580 AI871746 AI922524 N80292 
AI676213 BE4670e6 N92405 AV65631 1 A1678172 AW339455 A1949278 AI872882 A11237B5 AA01 6267 AI961623 AW168519 F28686 F37181 
AA908333 AW020788 AW374027 

AL1 09963 AAt34692 BE273642 M78295 AI825179 AA228029 BE538677 AW589948 AW241564 AA732224 AA22B030 AW977225 

BE154847BE154890AA328702 

AW936874 AW936923 AW936924 AA330274 

AA333232AW9626g5R54896 

X92108 AW295478 A176B675 AI086644 AW1 90160 AA31 3783 AA322741 AA354B98 AW962099 AL044BB7 AA282648 AA973897 AU)36586 

AA386055AL135179AW882199 

T74062 F12441 AA336207 BE185031 



AA364602 AA355215 AW962122 
AA354785AA354792AW958427 
N59555AA362113AA362136 
AA362525AW979199AA857501 
AA363171 AyV963347 AA371863 

AA629695 AA365582 AA365581 AW959799 AW876559 AW876588 
AA366951 AA470999 AA46g425 
AA370352AA370860AW962776 
AA379464 AA379611 AA379463 

AW068302 AI754558 A1750727 A1752B31 AA302174 AA327522 M64110 AW859944 AW859989 AI751995 AA7B9620 AI858829 A1924875 
AI888836AA864291 AI6B5060AW088029AI924908'AW466328 AI093800AA9gi651 AI254501 BE004703AA334442AW938852AA1 94330 
AIJ046953 AA852866 AW391995 W30846 AW66292B W25261 AA042B63 R99045 H97060 W03910 H94687 TBB964 AUMBI 65 T29632 N316S6 
N36484 AI798679 AA989355 W23832 AA873789 A1743646 AA363587 AI814748 AW338990 N73740 N83666 AL047816 R24137 R63433 
AA524984 AA234043 AA195131 N99903 AA453669 AI240302 AA370271 A1960026 AW771049 AA121476 AA569557 AI752632 A1355594 
A1471993 A1159941 N94555 A1753138 N21537 H97881 N26769 AW066044 AA808425 RB3380 AA384736 AA3B4738 AA852352 AI073645 
AA527960AA625036AA044414A1752460AA703064 R01216AA897183AI751996 181078 H95047 AA573642 D58348 N20953 AA437143 
N95439 AA579540 AW867056 AA770090 AI085180 A1806799 AA426421 AI67261 3 R24061 AA8631 89 AA295620 AA234044 AA371 020 
AW994984 H20e96 AW964438 AA318516 AA318499AA316727AA318211 AASIS-^ye AA31B444 M3183C7 AA318497 AA31B448 AA318309 
AA318496AA318213AA3ie435AA318424AA318217M318523 AA318438 AA318487 AA318724AA5931S6 AW994985 T69842 A1251Bi3 
AA478174 AA447737 T68350 F07712 AA121 145 H08973 AA345212 BE00C667 AW06821 Q AW603407 R0E674 H16712 N85426 N42354 
H85516 BE147991 T28113 R32662 AA3B4678 AW239275 'HB2382 AW84070C D58229 CC4082 W45394 AW795667 R73973 BE002409 
AA042828 AA3B3555 AJ223812AA3447C9 BE149590 R70995 W46881 WSC778 N71242 AA534326 AL040676 R23797 H96450 AA062957 
D79947 W46g60 AWg59278 AA295997 AA026215 AW579469 AW366135 AW355134 AW994363 AW972886 AW069166 AA343690 AW888731 
A175i527 AA937490 AA937506 AI826715 BE465604 AI925532 AI858109 AW339097 AI858524 AI720571 BE046506 AW3B4981 AA04390B 
AA375983AA525181 AW068366 AW070577 AW891837 N83986 AW182753 A1422979 A1679733 BE006555 AL048165AI081401 AI888821 
A1626043 N37C87 A1624140 AI801298 AA600048 AI753947 HB9B15 N66424 AW0B9756 AI814880 AI982806 AI764287AI971816 AW069022 
AW069069 AW069454 AA3429a9 AI07771 2 AI31 1467 AI087361 AI801 01 5 W46993 AI281 324 AW1 91 963 AI421 675 AI300881 AI356670 
AA873156 AI004219 AI189685 AA478018 AA076063 AI445222 A1753124 A1521569 A1925026 A1022368 A1475993 H20846 A1223234 A1635123 
AA579170 N30442 AW117889 AA807935 AA558975 AI30663B AA888963 AI952591 AI935835 AI446293 H16713 AW139833 AA622122 
AA972051 AI280B28 H09543 AI453725 AW069613 AI865615 AI753921 AI368782 AI633208 AI446651 W46961 N22201 H82276 C16555 
AA291477AW440535AW517755AA669921 AI926777AW662118AA563369 N67873AW023948 015861 C16601 A1251465AW079187 
BE045090 AI273006 CI 6390 C16503 A1620823 F13661 N66864 Z21311 C16108 CI 6089 C16400 AA758273 AI287781 AA864676 AW608074 
AW385583 AI589944 AA665817 AW192979 AW469065 AA564048 H84715 C16417 AA7ai072 AA661674 CI 6487 N29477 AW1 89997 
A1370492 C16471 AA652809 AA938B87 AA506512 CI 6306 AW028413 A1537935 AA528347 C16255 AW029046 CI 6202 A1868152 A1524662 
T94414 AI567041 A1619654 AW008486 AI075624 AAS77434 AA345104 T30105 AA932002 C16586 AI760390 AW294265 AI619552 AA669781 
AA026678 AW132002 AW263919 C16562 AA759137 AA693351 Z40779 CI 6577 AABB5045 AW073763 R45484 AI520895 U5470B T492B5 



194 



wo 03/003906 PCT/LS02/21338 



428518 
429321 
429468 
430068 
430439 
431089 
431322 
431926 
432222 
432407 



437046 

437908 
438089 
438091 



292383 1 
3C2955J 
3C498 1 
312849 1 
31808.1 

331643J 
339082J 
343347_1 
34624.1 



39406J 
399701_1 
42361 3 
43210J 
435447.1 
446001.1 
44963.1 
44964.1 



443197 
443357 
444163 
444298 



454609 
454629 
454631 
464639 
454662 
454701 
464721 



640681.1 
64334_1 
647829J 



844585J 
844617.1 
8821 30J 
922372J 



1207274J 
1221564J 
1226517J 
1227240 1 
1227443J 



AI668126AW006569A1093317AL119781 T61046A1053563 H51958AF114144AA305739AW950394AW793928AW793910AL047737 

AV659047 AV659632 A1750389 AA092053 AAC92798 H85367 T61597 R23745 Z20418 T78485 AI751528 AW068121 AA853188 AI752469 

AA853711 AW950663 R78964 R36359 R21626 R21522 

AW969656 AA501412 AA905186 AA429703 AA431958 

AA449921 AA449922 C75309 

AF033579 NM_005995 

AA464954 M854C5AA947556 

AL133561 AL041090 AL117481 AL122069 AW439292 AI968826 
B E041 395 AA491 826 AA621946 AA71 5980 AA6661 02 
AW97C622 AA503009 AA502998 AAS02989 AA602805 T92188 
AW972724AA522631 AA877998 
A1204995 AW827539 AW969908 AW440775 AA528756 

AA221036 R87170 BE537068 BE544767 C18935 AV;812056 T926e6 Ayi,227415 M233S/:2 AA223237 AA6B8403AA601627 AW869639 
BE051833 BE000620 AW961 170 AW847519 AA308542 AW82 1 633 AW9«e88 C04699 AA205504 AA377241 AW821667 AA055720 
AW817981 AW856468 AA1 55719 AA1 79928 T03007 AW754298 AA227407 AA1 1 3928 AA307904 CI 6859 
AA568548 AI050036 AA554053 AI911659 AI826259 
AW970018 AA573669 AA573622 R08736 
AW162474 AA588442 AI972440 

A1798376 S46400AW811617AWB11616W00657BE142245AW858232AW861851AW858362AA232351AA218567AA055556AW858231 

AW857541 AWai4172 H66214 AW814398 AF134164 AA243093 AA1 73345 AA199942 AA2233e4 AA227092 AA227080 T1 2379 AA092174 

T61139AA149776AA699829AW879188AW813567AW813538AI267168AA157718AA157719AA100472AA100774AA130756AA157705 

AA157730AA157715AA053624AW849581 AW854566C05254AW332336T92637AW312621 AA206533AA20g204BE156909AA226824 

AI82g30g AWg91 957 N66gsi AA527374 H6621S AA045S64 AI69426S H60B08 AA149726 AWigS620 BE081333 BE073424 AW817662 

AW817705 AW817703AW817659 BE081631 H59570 

AW852235AF156166 R07008 

AA664078 AW363313 AA805009 

AA623980AI126603 BE504035 

BE149154 BE149151 X60138 

AW975957 AA747943 AA81 1289 

AI082424 AI740586 AA771764AA771806 AI033879 BE500996 AW204531 D38676 
W05391 X56197 H75313 AW770789 AA281642 AV654440 AA344646 R00244 T83378 

AW373a62T55662AI2ggi90BE174210AW579001 H01811W40186 R67100AI923B86AW952164AA628440AW898607AW898616 
AA709126 AW898628 AW898544 AA947932 AW898625 AW898622 AI27612S AI185720 AW510698 AA987230 TS2522 BE467708 AW243400 
AW043642AI288245A1186932D62654 D56017 D62715 D52477 D53933 D54679A1298739A1146984A1922204 N98343BE174213AA845571 
AI813854 AI214518 AI635262 AI139455 AI7078C7 AISgSOBS AWBB4528 AI024768 AI004723 AWC87420 AI565133 N94964 AI268939 
AW61328aA10B1126A1436818A1859106A1360506A1024767AA513019AA757598 X56196AA902959A1334784A1860794AA010207 
AW89CC91 AW513771 A1951391 A1337671 T52499 AA890205 A1640908 H75966 AA4B3487 AA3586B8 A1961767 A1866295 AA780994 
A1985913BE174196AA029094AW592159T55581 N79072AI611201AA910812AI220713AW149306A1758412AA045713 R79750 N76096 
L09078 L03146 L09094 L09098 L03166 L09102 
M27346 

AF086410W94386 W74609 
AWg7924g D63277AA846968 
BE561384AW732707 

AI74499S AI973099 Rig291 AA93S21S AA897534 AI698070 BE179102 AI6g4912 
AW850g80AA969613AW3667g3 

AI033188 BE004743 AW804O74 BE0047gS BE178660 BE089438 BE08g378 BE089439 BE089437 BE089423 BE089420 BE08942S BE089424 

Z43613 AA32ai91 AW964963 H16889 R83872 

AW016r73 AI052778 AI452937 AW085293 

A112609BA1184746AI148521 

Z17870AL039867AI1399Z7 

R44007 N54521 A1140476 AW418649 T95308 

AI936264 AI219228 AI200740 BE300771 AI63S739 AI690626 AW044010 R60842 R40388 
AW460439 AW297340 A1239849 A16131 1 9 
AW97401 3 AA557257 N36812 AI59799B AA873565 
BE1 58869 AI248241 BE158865 BE158868 
R31107AI 341136 AI65319BH04953 

BE010189AW879041 BE008038AW905325AA343575AW844209AL038020BE010133BE010083AW903901 BE174377A1309717AI349661 
AigO3726BE176661AW90S3g4 
AI623875 R45782 R45781 
T94709AI820675AI732253 
R55428 A1820704 AI732283 R549B3 
AI732321 R66640 R56639 AI820744 
Aig03765 AI811194 BE0O7147AW130760 
BE066552 BE066341 AI907683 
AL043049T95976 
AL1 37967 OF 
AL138129AL138179BE 
AW763456AV 
AW8C3980AW803974 

AW810204 AW810555 AW810196 AW810619 AW810507 
AW811114AW811095AW811087AW811124AW811054AW811094AW811157 
AW811324AW811325AW811326AW811333AW811329AW811328AW811332AW811339AW811335 
AW81 1633 AW81 1652 AW81 1898 
AW812715AW812646AW812714 
AW854930 AW814431 AW814190 AW814115 AW854941 

AW815588AW815671AW815493AW81589BAW815613AW815490AW815808AW815426 
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455407 
455508 
455527 

455649 
455753 
455757 



457667 
457581 
457741 
457871 
457951 



TABLE 8C 



1236377J 
1242343_1 



1318507J 
1322125_1 
1348163J 

1356070J 
135B657J 
1382301.1 



179104J 
184123_1 
313764_1 



AW833676 AW833814 AW833798 AW833677 AW833449 AW833630 AW833826 AW833444 AW833366 AW833791 AW833669 AW833432 

AW833534 AW833556 AW833553 

AW841619 AW851958 AW851851 AW851986 

AW854352 AW854311 AW854340 AW854461 

AW855389 AW855556AW855420 

AW857271 AW8573Q8AWa67296 AW857258 

AW875972 AW8759a3 AWa75974 AW876000 AWB75966 AW8760S0 

AW891707AW891696AW891917AW891913AWB91912AW8919Q9AW891890AWa91892AW891B89AW891697AW891880AW891898 
AW936813 AW936731 AW936728 AW936600AW936681 AW936661 

AW976165 CQ4QQQ 

AW984479 AW984498 AW984495 AW984477 AW984480 AW984604 

BE063965 BE063968 BE084034 BE064028 BE063874 BEQB3966 BE063869 BE064043 BE064033 BE063884 
BE065051 BE155165 BE064764 BE155231 BE064648 BE064671 BE064636 
BE076124 BE076229 BE075278 

BE079531 BE079371 BE079372 BE079693 BE079468 BE079604 BE079505 
BE1S6306 BE1S6188 BE15S298 BE1S6377BE1S6374 

BE166765 BE166770 BE156767 BE156769 BE156803 BE1 56802 BE156847 BE156853 BE156780 BE156836 BE156792 BE156834 BE156779 
BE156789 BE156833 BE156B44 BE156831 BE166849 BE156797 BE166784 BE166801 BE166843 BE156793 BE156852 
BE177983 BE178322 

AW460979 AA138653AA136666 AW419381 AA984358 AA492073 BE168945AA809054 AW238038 BE011212 BE011359 BE011367 
BE011368 BE011362 BE011215 BE011366 BE011363 



AA228357 AW841786 AW84171 6 

AA236606 AA459341 AA237079 
AW968188 AA468196 AA468269 AA46B298 

AW970989 AA502167 AA507546 „ 

AW939074 AW939073 BE160476 AW939938 AW939206 AW940012 AW939076 AA573577 AW760479 AA574383 AW970067 
AA678512AA595535BE177533 

BE044740 AW827360 AW827623 BE161439 BE044718 BE046207 BE046551 AA663908 BE1665B1 

AI168278 AA868238 BE550792 A1522194 A1819707 AA97353B AI990086 AI628424 AI096270 AI991608 AA730741 

U23860U80739 

A1940425 



Pkey: Uniquerumberoorresponding loan Eos probesel 

Ref; Sequence source. The 7 digit numbers in tliis column are Genbank Identifer (Gl) numb 

sequence of human ctitomosome 22." Dunliam I. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DI^IA strand from wliich exons were predated. 
NljosiUon: Indicates nucleotide positions of predicted exons. 



to the publication entitled Tiie DMA 



400834 
400840 
400850 
400881 
401016 
401090 
401215 
401241 
401336 
401381 
401400 



402049 
402085 
402241 
402294 



2642777 
8117441 
8492704 
9858408 
4827300 

8570226 
7708226 
6882292 
7249001 



9369520 
3873185 
7547159 
7770432 
8072512 
7249154 
7690131 



157683-153035 
121963-122288 
113882-114121 



30503-30844,31056-31248 
15736-16352 

118629-119146,119392-119657 
33028-33585 
125521-125639 
115142-117305 

139377-139674,141195-141281,142217-142340 



147273-147503 
136003-136726 
10964-11084,11674-11817 
148965-149396,149569-150002 



125073-125206,130998-131125 
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402366 
402551 
402654 
402685 
402762 
402797 
402833 
402901 
402948 
403066 
403072 
403146 
403197 
403214 
403217 



26987-27778 
176426-175667 

143456-143626,143808-143935 



162877-163118 
79990-80237 

76723-77027,79317-79484 
54089-54163,55427-55623 



95177-95435 
41565-41881 
125117-125287 



4241042544,83317-83540,86840-86922,87970-88110 



95602-95969 
113738-113858 
104545-104757 



109718-109847,109927-110202 



404418 
404427 
404429 
404440 
404495 



51086-51301 

153339-153481,155099-155294 
127170-127358 



117025-117170,118567-118736 

106184-106313 

4964449760 

70284-70518 



405727 
406760 
406779 

405944 
406002 
406016 
406097 
406104 
406156 
406207 
406300 
406308 
406314 
406317 
406432 



8272661 Plus 



138842-139061 
114896-116831 
78865-79664 
37424-38046 



35698-37269 
35309-35977 
379-697 



358408-358661 

12899-13011,18022-18136 

108018-108410 
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TABLE 9A: Genes prediclive of no bladder cancel progression 

Ptey. Unique Eos probesel idenllfler number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiO; Unigene number 

Unigene Tille:Umgene gene title 

R1 80th percenllle of Ta or T1 tumor Als frortr patients who did not upstage divided by the 80th percentile of Ta or T1 tumor Als from patients who did upstage 
R2 median of Ta or T1 tumor Als from patients who did not upstage divided by the median of Ta orT1 tumor Als t-om patients who upstaged 







UnigenelD 


Unigene Title 


R1 


R2 


408000 




Hs.198689 


bullous pemphigoid antigen 1 (230C40kD) 




5.88 




M21984 




troponin T3, skeletal, bst 


4.54 


5.10 




NM_001546 


Hs.34863 


inhibitor of DNA binding 4, dominant neg 


4.37 


1.63 








N M_0031 05*:Homo sapiens sortiiin-reiated 


3.69 


5.90 




AA244416 




gb;nc07d11.s1 NCLCGAP J^l Homo sapiens 


3.61 


2.03 




AW518944 


Hs.76325 


slep II splicing factor SLU7 


3.60 


1.00 




BE410132 




ESTs, Weakly similar to T17279 hypothetl 


3.58 


1.04 




AW1 89787 




ESTs 


3.57 


270 




H25836 


Hs.301527 


ESTs, Moderately similar to unknown |H,s 


3.56 


1.48 


421314 


BE440002 


Hs.ia0324 


Hcmo sapiens, clone 1I«WGE:41 8331 2, mlVIA, 


3.28 


3.25 




M68874 


Hs.211587 


phosphollpase A2, group IVA (cylosolic, 


3,25 


2,50 




U56979 
AL035668 




H factor 1 (complement) 
bone morphogenelic protein 2 


3.20 
3.20 


2.30 




AW964897 


Hs.290825 


ESTs 


3.20 


1.53 




A1249361 


Hs.74122 


oaspase 4, apoptosis-related cysteine pr 


3.00 


4.10 




AB014680 


Hs.8786 


carbohydrate (N-acetylglucosamlne-fi-O) s 


2.94 


278 




AW072003 


)^s.40968 


heparan sulfate (glucosamine) 3-C}«ulfol 


2.93 


1.43 


426283 


NM-003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


2.93 


1.33 


428030 


Ai915228 


H5.11493 


Homo sapiens cDNA FU13536 fis, clone PL 


2.92 


2.47 




AW968068 


Hs.92381 


nudix (nucleoside diphosphate linked mol 


2.89 


3.33 


414407 


AA147026 


Hs.76704 


ESTs 


2.87 


2.87 




AW204145 


H5.156044 


ESTs 


m 


1.86 


411243 


AB039886 


Hs.69319 


CA11 


2.73 


1.00 


417878 


U90916 


1^.82845 


Hcmo sapiens cDNA: FU21930 fis, done H 


268 


2,32 


441619 


NM 014066 


Hs.7917 


DKFZP564K247 protein 


2.67 


298 


446619 


AU076643 




secreted pliosphoprolein 1 (osleoponlin. 


2.66 


1.00 




AK00O517 


Hs.6844 


hypothetical protein FU20510 


2.64 


1.77 


426252 


BE176980 


Hs.28917 


ESTs 


2.63 


7.30 


424008 


RC2740 


Hs.137555 


putative chemokine receptor GTP-binding 


2.60 


2.63 


429429 


AA829725 


Hs.334437 


hypothetical protein MGC4248 






427450 


AB014526 


Hs.178121 


KIAA0626 gene product 


2.57 


2.28 




AI004036 


Hs.25191 


ESTs 


2.56 




434061 


AW024973 


Hs.283675 


NPD009 prote'in 


2.54 


2,10 


422070 


AF149785 


Hs.11 1126 


pituitary tumor-transforming 1 interacti 


2.54 


3.25 


419366 


AA428620 


Hs.90061 


progesterone binding protein 


2.53 


3,63 


446215 


AW821329 


Hs.14368 


SH3 domain binding glutamfe acid-ifch pr 


2.52 




432442 


AI672516 




ESTs, Weakly simnartoS65657alpha-1C- 




4.60 


447887 


AA1 14050 


Hs.19949 


oaspase 8, apoptosis-related cysteine pr 




2,23 


401155 






Target Exon 


2.48 


2,30 


404530 






Target Exon 




1,00 


446006 


NM_004403 


Hs.13630 


deafness, autosomal dominant S 


2.48 


2,36 


436476 


AA326108 


Hs.33829 


bHLH prolein DEC2 




2,88 


446535 


AF257175 


Hs.15250 


peroxisomal D3,D2-enoyl-CoA Isomerase 




2.19 


408636 


BE2g4925 


Hs.46680 


CGI-12 prolein 


2.45 


1.60 














427008 


Z4S2S8 


Hs!286013 


short colled-coll protein 


2!42 


3^40 


459711 


BE386801 


Hs.21858 


trinucleotide repeat containing 3 


2.40 


278 


407910 


AA650274 


Hs.41296 


fibronectin leucine rich transmembrane p 


2.40 


1.00 


410337 


M83822 


Hs.62354 


cell division cycle 4-like 


2.39 




435029 


AF167706 


Hs.19280 


oysleine-rich motor neuron 1 


2J9 


3^23 


437181 
410968 


AI306615 
AA199907 


Hs.125343 
Hs.67397 


ESTs, Weakly similar to KIAA0758 protein 


2.39 
2.38 


1.00 
1.33 


422611 


AU076442 


Hs. 11 7938 


collagen, type XVII, alpha 1 
bone morphogenelic protein 2 


2.38 


6.40 


450775 


AA902384 


Hs.73853 


2.38 


2.71 


442433 


BE243044 


Hs.8309 


KIAA0747 protein 


2.37 


3.68 


454000 


AA040620 


Hs.5672 


hypolhetteal protein AF14022S 


2.36 


1.14 




BE619526 


Hs.255527 


hypothetical protein MGC14128 


236 


2.02 


427985 


A1770170 


Hs.29643 


Homo sapiens cDNA FU13103 fis, clone HT 


2.36 


2,18 


442257 


AW503831 


Hs.323370 


Human EST clone 26267 mariner (ransposon 


2.35 


3.53 


454070 


N79110 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


2.35 


3,60 


418452 




Hs.85201 


C-type (calcium dependent, oarbohydrate- 


2.35 


248 


421218 


NM_000499 


Hs.72912 


oytochrome P460, subfamily 1 (aromatic o 


Z35 


1.00 


407793 


AW080879 


Hs.236572 


gb:xc38g04j(1 NCI^CGAP Co20 Homo sapiens 


2.35 


1.21 




AA77'1284 


Hs.285728 


abl-interactor12 (SH3M:ontaining protel 


2.34 


3.03 


402846 






ENSP00000246267:KIAA0444 PROTEIN (FRAGME 


2.34 


1.62 


411407 


R00903 


Hs.169793 


ribosomal protein L32 


2.34 


0.77 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


2.34 


0.72 
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424637 


NM_015057 


Hs.151411 


KIAA091 6 protein 


2.32 


2.66 




T64682 




gb:yc48b02.r1 Stralagene liver (937224) 




1.46 


411060 


m 006074 


Hs.318501 


Homo sapiens mRNA full lenglii Insert cDN 


2.32 




430028 


BE564110 


Hs.227750 


Target CAT 


2.32 


228 


417720 


AA205625 


Hs.20a067 


ESTs 


2.32 




436396 


AI683487 


Hs.152213 


wingiess-type MMTV integration site femi 






454219 


X75042 


Hs.44313 


v-rel avian reticuloendotheliosis viral 






444745 












408179 






poly(A)-specific ribonuclease (deadenyla 








R53718 


Hs.107882 


hypothetical protein FU 10659 


229 


279 


458949 


AW291777 




ESTs, Weal<ly similar lo T08599 probable 


2.28 


1.85 


407191 


AA608751 




gb:ae56h07.s1 Slratagene lung carcinoma 






448367 


A1966411 


Hs.94109 


Homo sapiens cDNA FU13634 (s, clone PL 


227 


l!l8 


405155 






Target Exon 


2.26 




445594 




Hs.12940 


zinc-fingers and homeoboxes 1 


2.26 


155 


417458 


NM_006655 


Hs.82173 


TGFB inducible early growth response 


225 


1.96 


430315 


MM 004293 


Hs.239147 


guanine deaminase 


224 


1.84 


411945 


AL033527 


Hs.92137 


v-myc avian nnyelocylomatosis viral oncog 


224 


273 


408937 


AA210734 


Hs.291386 


ESTs 


224 


3.18 




AL133990 


Hs.190642 


CEGP1 protein 


223 


1,00 


434094 


AA305599 


Hs.238205 


hypothetical protein PRO2013 
hypotheticd pntein FU103S2 




4.08 


420997 


AKC01214 


Hs.100914 




215 


420164 


AW339037 


Hs.24908 


ESTs 


222 


216 


414099 


U11313 ■ 


H5.75760 


sterol carrier protein 2 


221 




424800 


AL035588 


Hs,153203 


IVIyoDfanvly inhibitor 


2.21 


3.53 


459005 


AA447679 


Hs,144558 


ESTs, Weal<ly similar lo ALUI.HUMAN ALU S 






416290 


MM_000016 


Hs.79158 


acyl-Coenzyme A dehydrogenase, C-4 to C- 






439208 


AK000299 


Hs, 180952 


dynactin 4 (p62) 


220 


1.88 


401563 






C1 5001262:gil7304981 lreflNP_03a528.1| ca 


220 


1.77 


404687 






C9000375':gi|11994617|dbi|BAB02754.1|(A 


219 


260 


443303 


U67319 


Hs,9216 


caspase 7, apoplosis-relaled cysteine pr 


219 


2.08 


439866 


AA280717 


Hs.6727 


Ras-GTPase activating protein SH3 domain 


219 


2.21 


400835 


AW853954 




chromosome 2 open reading frame 2 


218 


3.00 




AF035528 


Hs.1 53863 


MAD (mothers against decapentaptegic, Dr 


218 


245 


431689 


AA3Q5688 


Hs.267695 


UDP-Gal:betaGlcNAc beta 1,3-gala:tosyltr 


217 


1.83 


418026 


BE379727 


Hs.83213 


tally acid binding protein 4, adipocyte 


217 


2.67 




AI267586 


Hs,268012 


falty-acid-Coenzyme A ligase, long-chain 


217 


1.71 










217 






AW880709 


Hs.283683 


chromosome 8 open reading Urame 4 


217 


100 




AA315805 






217 


1.81 




AW503990 


Hs. 142442 


HP1-BP74 


217 






MB9907 


Hs. 162292 


SWI/SNF related, matrix associated, adi 


217 


2.93 








ENSP00000252204*:Zinc finger protein 165 


217 






AA323750 


Hs.235026 


Homo sapiens, done lli/IAGE:4247S29, mRNA, 


216 


2]85 




AA311919 


HS.698S1 


nucleolar proleh farify A, msmbsr 1 (H/ 




360 




AW847318 


Hs.290131 


KIAA18ig protein 


216 


1.96 




AA312572 




phosphainositIde-3-)<inase, regulatory su 


216 


1.48 




BE207573 


Hs]83321 


neuromedin B 


216 


2.07 




A1075407 


Hs.296083 


ESTs, Moderately similar lo 154374 gene 


216 


232 




AI142379 




gb'.qg64c01.r1 Soares testis t^HTHomosap 


216 


1.85 




AI678033 


Hs.121476 


ESTs 


215 


1.43 


418827 


BE327311 


Hs.47166 


HT021 


216 


3.B4 


417470 


AF112219 


Hs.82193 


esterase D/formylgiutathione hydrolase 










Hs.101025 


basic transcription factor 3 


2.'l5 


1.26 




AW390822 


Hs.301628 


L-kynurenine/alpha-aminoadlpate aminotra 


215 


4.05 




AB029032 




KIAA1 109 protein 


215 


2.15 






Hs.3843 


Homo sapiens mRNA; cDNA OKFZp586F2224 (f 


215 


1.45 


424865 


AF011333 


Hs.1 53663 


lymphocyte antigen 75 












NM_003105*:Homo sapiens sortilin-related 


214 


267 


438916 


AW1 88464 






214 


2.38 




AI808780 


Hs.227730 


integrin,alpha6 


214 


2.00 


409345 


AI949109 




hypolheiical protein FU20783 


214 


1.40 


421939 


BE169631 


Hs.1 09727 


TAK1 -binding protein 2; KIAA0733 protein 


2.13 


1.58 


442315 


AA1 73992 


Hs.7956 


ESTs, Moderately similar lo2N91JHUMANZ 






419591 


AF090900 


Hs.91393 


Homo sapiens cDNA: FU21887 fis, clone H 


213 


2.00 


458026 


A1276406 


Hs.32450 


gb:ql63ciax1 Soares_NhHMPu_S1 Homosapi 


212 
212 


0.89 


428582 
422749 


W01076 


Hs.185065 
Hs.278573 


BENE protein 

CD59 antigen p18-20 (antigen Identified 


212 


273 


433091 


Y12642 


Hs.3186 


lymphocyte antigen 6 complex, locus D 


2.11 


0.91 


456421 


AL157485 


Hs.91973 


hypothetical protein 


211 


2.51 


421508 


NM_004833 


Hs.105115 


absent in melanoma 2 


211 


3.13 


402760 




NH<_02179r:Homo sapiens eosinophil chemo 




1.79 


406274 






Target Exon 


209 


1.60 
1.00 


406897 


M57417 




gb:Homo sapiens mucin (mucin) mRNA, part 


209 


409632 


W74001 


Hs.66279 


serine (or cysteine) proteinase inhlbito 


2.09 


2.92 


445320 


AA503887 


Hs.167011 


Homo sapiens cDNA: FU21362 fis, clone C 


209 


3.20 


442271 


AF000652 


Hs.8180 


syndecan binding protein (syntenln) 


2.09 


1,90 
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60 
65 



428336 


AA503115 


Hs.183752 


niiciosaniiiopfDieiii, beta- 


208 


405165 






ENSP00000238974*:Homeobox protein NKX2-3 


2.07 


416999 


AW195747 


Hs.21122 


hypolhelteal protein FU11830sralar to 


Z07 


453865 


AA307279 


Hs.35947 


melhyl-CpG binding domain protein 4 


Z07 


439924 


AI965B97 
AVV979062 


Hs.125293 


ESTs 

gb:EST391172 MAGE resequences, MAGP Homo 


2.07 
Z07 


407965 


BE536739 


H5.109909 


ESTs 


2.06 


412998 


BE046254 




gb:tin38g09.x2 Na_CGAP_RDF2 Homo sapiens 


206 


414013 


AA766605 


Hs.47099 


hypothetical protein FU21 21 2 


Z05 


415249 


R40515 


Hs.21248 


ESTs 


2.05 


427332 


R09418 




ESTs, Weakly similar to 138022 hypotheti 


205 


426521 


AF161445 


Hs. 1702 19 


hypolhelica) protein 


205 


431211 


MB6849 


Hs.323733 


gap junction protein, beta 2, 26I(D (conn 


205 


423851 


R39505 




Homo sapiens done 24566 raRNA sequence 


205 


4)0028 


AW576454 


Hs.346502 


ESTs 


204 


406575 






Target Exon 




457148 


AF091035 


Hs.184627 


KIAA01 18 protein 


2.04 


449924 


W30681 


Hs.146233 


Homo sapiens oDNA: FLJ22130 f s, done H 


2.04 


429837 


NM_003896 


Hs.225939 


sialyllransferase 9 (CMP-NeuAc;lactosylc 


204 




AW005054 


Hs.47883 


ESTs, Weakly similar to KCCI^HUMAN CALQ 


204 


411988 


AA455459 




ESTs, Weakly similar to T60609 hypotheti 


204 


433293 


AF007835 


Hs.32417 


hypolhetical protein MGC4309 




446187 


AK001241 


Hs. 14229 


hypolhetlcal protein FLJ10379 


204 


420838 


AW118210 


Hs.42321 


ESTs 




445481 


AW661846 


Hs.346630 


ESTs 


203 


448175 


BE296174 


HS.22S160 


hypothetical protein FU13102 


203 


410600 


AW575742 




ESTs, Moderately similar to S6S657 alpha 


202 


401177 






Target Exon 


202 


448474 


AI792014 


Hs.13809 


hypothetical pmtelnFUl 0648 


202 


434782 


NM.005032 


Hs.4114 


plastinSCrisofonn) 


202 


424125 


M31669 


Hs.1735 


inhlbin, beta B (actlvin AB beta polypep 


202 


424241 


AW995948 


Hs.182339 






424673 


AA346051 


Hs.294092 


ESTs, Weakly similar to 138022 hypotheti 


202 


414721 


X90392 


Hs.77091 


ribosomal protein LID 


Z02 


429869 


A1907018 


Hs.15977 


Target CAT 


202 


439177 


AW820275 


Hs.76611 


ESTs, Weakly similar to 138022 hypotheti 


Z01 


437175 


AW968078 


Hs.87773 


prolein kinase, cAMP-dependenl, calalyfi 


Z01 




AB018345 


Hs.27657 


KIAA0802 prolein 




417615 


BE548641 


Hs.82314 


tiypoxanthine phosphoribosyltransferase 1 


201 


420337 


AW295840 




Homo sapiens c DNA: FLJ21513 Js, clone C 


200 


408232 


AL1 37269 


Hs.43899 


Homo sapiens mRNA; cDNA DKFZp434C1714 (f 


200 


408409 


AW838181 


Hs.278337 


Homo sapiens oDNA FU1 1537 fls, clone HE 




433256 


AW604447 


Hs.33940a 


ESTs, Weakly similar to S26689 hypotheti 


200 


426969 


A1936504 


Hs.2083 


CDC-like kinase 1 




442053 


R35343 


Hs,24968 


Human DNA sequence from clone RP1-233G16 


2.00 




AB028956 


Hs.12144 


KIAA1033 prolein 


2.00 


452286 


AI368670 


Hs.1 23933 


ESTs, Weakly similar to ZN91 HUMAN ZINC 


2.00 


414906 


AA1 57911 


Hs.72200 


ESTs 


1.99 


414176 


BE1 40633 


Hs.75794 


endothelial differentiation, lysophospha 


1.99 


414557 


AA340111 


H6.100009 


acyl-Coenzyme A oxidase 1, palmltoyi 


1.99 


452346 


AA082160 


Hs.63368 


ESTs, Weal<lysIn<lartoTRHY_HUMAN TRICH 


1.99 


408437 


AW957744 


Hs.278469 


lacrimal proline rich protein 


1.98 


439205 


AF087990 


Hs.42758 


Homo sapiens, done 1MAGE3354845, mRNA, 


1.98 


442506 


BE566411 




ESTs 


1.98 


447731 


AA373527 


HS.1938S 


CGI-58 protein 


1.98 


410579 


AK00162B 


Hs.64691 


KiAA0483 protein 


1.97 


426716 


NM_006379 


Hs.171921 




1.97 


456141 


A1751357 


Hs.288741 


Homo sapiens cDNA: FU22256 fis, clone H 


1.97 


419576 


AK002060 


Hs.91251 


hypothetical prolein FU11198 


1.96 


407241 


M34516 




gb:Human omega light chain protein 14.1 


1.96 


420664 


A1681270 


Hs.99824 


BCE-1 protein 


1.96 


448586 


AF285120 


Hs.283734 


CGI-204 protein 


1.96 


4C8089 




Hs.42644 


Mredoxin-llke 






AW35T839 






195 


452518 


AA280722 


Hs.'24758 


ESTs, Weakly similar to 138022 hypotheti 


{95 


432015 
434263 


AL1 57504 
N34895 


Hs.159115 
Hs.44648 


Homo sapiens mRNA; cDNA DKFZp586O0724 (f 

ESTs 


1.94 
1.94 


4C9829 


M33552 


Hs.56729 


lymphocyte-specific protein 1 


1.94 


425593 


AA278921 


Hs.1 908 


proteoglycan 1, secretory granule 


1.94 


401835 






Target Exon 


1.94 


406557 






C5000893:gi|6226859|sp|P38525|EFG_THEIWA 


1.94 


440062 


AI350518 


Hs.1 29692 


ESTs 


1.94 


410442 


X73424 


Hs.63788 


propionyl Coenzyme A carboxylase, beta p 


1.94 


467281 


BE253012 


Hs.153400 


ESTs, Weakly similar to Ami_HUMAN ALU S 


1.94 


420230 


AL034344 


Hs.284186 


brkheadboxCI 


1,93 


452970 


NM_012238 


Hs.31176 


sirtuin (silent mating type Information 




403728 






Target Exon 





wo 03/003906 
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406759 
442073 
438023 
445502 
405474 
430007 
439937 
418068 
444630 
451184 
414715 
445841 
425284 
437943 
442426 
400111 
437762 
404069 



422506 
417439 
404391 



400634 
408466 
422366 
452170 



431129 
412843 
401512 



AW971155 

AI753230 
T87943 

AL080115 
AF1 55568 
NM_0I6353 
AI373062 



AW602154 

AK001714 
AA305800 

C19034 

T83882 

AF064801 

AV660637 

AA3796S8 

AW993680 

AL137751 

AF007555 

H11294 
AA533491 
M90464 



Hs.323806 
Hs.300741 
Hs.82143 



H3.288613 
Hs.97927 
HS.2B28S 
H3.247309 
Hs.272759 



gb:yi33l08j1 Scares placenta Nb2HP Homo 
ATP synthese.H transporting, mitodiondr 
RNA (guailne-7-) melliyllransferase 
FEM-1(C.eIegans)homologb 
DKFZP434J214 protein 
NM.001093*:HoiTiosa • 
KIAA1 116 protein 
CGl-148 protein 

ESTs, WesMy similar to ISHUSS protein d 
iiypoliietical protein DKFZp564K142 
transcription factor 7*8 2 (T-cell sp 
amylo-I.e^lucosldase, 4-alphafllucanolr 
DKFZP664G0222 protein 
NS1- 



Hs.31047 
Hs.23317 
Hs.169825 
Hs.97955 

Hs.85978 
Hs.273321 
13 

Hs.29724 
Hs.180378 



liypottietical protein MGC5370 
Eos Control 
synaptolagmin I 
Target Exon 

gb:EST386776 I^AGE resequences, MAGM Hom( 
gb:601 1 1 8231 F1 NIH_ll^GCJ7 Homo sapiens c 



E74-likB faclQr 2 (els domain transcript 
Target Exon 

ti/pottielical protein similar to ankyrin 
hypothelical protein AF14022S 
C10000818-;gi|7661882|reflNP.055697.11K 
Homo sapiens oDNA FU 14175 lis, clone NT 
ESTs 

patched related prolsin translocated in 
succinate-CoA ligase, GDP-fbrmIng, beta 
KIAA14S7 protein 

gb-KC3-BN0034-290200-013-d08 BN0034 Homo 
Homo sapiens mRNA; cDNA DKF2p434l0812 (f 

protein tyrosine pttosplialase, receptor t 
NM_014080:Homo sapiens dual oxldase-lil(e 
ESTs 

hypothetical protein FU14681 



442187 N23532 
4S828S AW296984 
413753 U17760 
428004 AA449563 
401613 
407173 
443146 



437374 

430677 
436749 
453016 



443963 
435479 
413073 
442473 
418060 



T64349 
AI049671 
AW976721 
A1.3S571 

AW502166 

Z26317 

AA584890 

AW295466 

AA393130 

AI417193 

R24601 



AA878183 
AF197137 
AL0381B5 



Hs.224961 Homo sapiens oDNA FLJ14415 «s, oione HE 

Hs.52515 Iransducin (bela)-like 2 

Hs.191326 ESTs, Weakly similar lo T18967 tiypotheli 

Hs.1 32955 Ba2/adenovirus El B 1 9kD-iiiterxting pro 

Hs.791 26 guanine micleoiide binding protein 1 0 

NM-003105':Homo sapiens sortirin-reiated 

Hs.288963 Homo sapiens cDNA: FU23034 its, clone L 

HS.255S9S ESTs, Weakly similar to A46302PrB4ssoc 

HS.75S17 laninin, beta 3 [nicein (IJSkD), kaiinin 

Hs.151393 glutamate-cysleine ligase, catalytic sub 

gb^W^^sl Stratagene king (937210) H 

Hs.307763 EST, WfeaWy simlla lo 138022 hypolhelic 

HS.2933Z7 ESTs 

Hs.44054 nin9in(GSK3B interacting protein) 



Hs.5302 lectin, galactosUe-blnding, solubis, 4 
Hs.232051 ESTs, Weakly similar to dJ403A15.3 tH.sa 
Hs.1 93894 ESTs, Weakly similar to A47582 icell gr 
Hs.2889 1 2 hypothetical protein FU22604 

CCR4-N0T transcription complex, subunlt 
Hs.72249 three-PDZ containing protein similar lo 
Hs.193616 B-cellCLUympliomalO 
Hs.17448 Homo sapiens cDNA FU13618«s, clone PL 
Hs.259737 ATP syntliase, H transporting, miloctiondr 
Hs,751g7 transkicase of outer mitachondrtal membr 

gb:43d9 Human retina cDNA randomly prfme 
H5.208047 ESTs 



201 



wo 03/003906 
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30 
35 



45 
50 









NM_003105*:Hoino sapiens sortilln-ielated 


1.77 


1.76 


400175 






Eos Control 


1.77 


2.04 


421501 
451234 


M29971 


Hb.1384 


0-6-methylgoanine-ONAmelliyllransferase 


1.77 


AI914901 

A1091466 


Hs.24052 
Hs.1 27241 


ESTs, Weakly similarlo 138022 hypolheli 

sorting nexin 7 


1.77 
1.76 




423960 


AA164516 


Hs.136309 


SH3-contalning protein SH3GLB1 


1.76 




450489 


AI697990 


Hs.346002 


ESTs 


1.76 




413076 


AB023212 


Hs.225967 


KIAA0995 protein 


1.76 




U10564 


Hs.75188 


weel (S. pombe) homolog 


1.75 


2.18 


421948 


L42583 


He.334309 


kerslin 6A 


1.75 


1.00 






Hs.81810 


ESTs 


1.75 


412430 


AW675064 


Hs.73875 


fumaiyiaceloacetale hydrolase (fumarylac 


1.75 




439396 


BE562958 


Hs.74346 


hypothetical protein MGC1 4363 


1.75 


178 


431448 




Hs.306201 


hypothetical protein DKFZp56401278 


1.75 


2.36 


449538 


A1559444 


Hs.104679 


ESTs 


1.75 


3.07 


453146 


A1338952 


Hs.32194 


ESTs 


1.74 


2,82 




m 006925 


Hs.166975 


splicing factor, arginine/serine-rich 5 


1.74 


2.8B 


408989 


AW361666 


Hs.49500 


KIAA0746 protein 


1.74 


2,07 


441715 


AI929453 




Homo sapiens cDNA FU13289 lis, clone OV 
spinocerebellar ataxia 1 (olivopontocere 


1.74 


2,06 


412718 


X79204 


Hs.74620 


1.74 


2.46 


450798 


AW167780 


Hs.5043a 


ESTs 


1.74 
1.73 


2,02 


445637 


AJ245671 


Hs.12844 


EGF-llhe^main, multiple 6 


2,58 


400190 






Eos Control 


1.73 


240 


416309 


R34694 


Hs.79194 


cAMP responsive element binding protein 


1.73 


410219 


T98226 


Hs.171952 


1.73 


2!75 


419814 


AW402478 


Hs.93213 


BCL2-antagonjst«lller 1 


1.73 


2.70 


448625 


AW970786 


Hs.178470 


hypothetical protein FU22662 


1.73 


2.07 


422387 


AA309996 


Hs.148656 


ESTs, Weakly similar to T12463 hypotheli 


1.73 


2.02 


417386 


AL037228 


Hs.82043 


D1 23 gene product 


1.73 


2.44 


405812 






Target Exon 


1.72 




436270 


C03769 


Hs.339669 


Home sapiens, done iMAGE:3947554, mRNA, 


1.72 


2M 


409855 


AW502461 




gb:UI-HF-BR0p-ajv-b-08-0-Ul.r1NIH_MGC_5 


1.72 




411442 




Hs.101810 


Home sapiens cDI^IA FLJ 14232 fis, ctane NT 


1.72 


1.88 


400846 
401660 
402190 






sortilin-related receptor, L(DLR class) 
Target Exon 

C19000835-:gij10946730|ref|NP_0673621j 


1.72 
1.72 
1.72 


1.63 
2.63 


439191 


AA281177 


Hs.41182 


Homo sapiens OC47 mRNA, complete cds 


1.71 


2!l7 


410444 
430393 




H3.132554 
Hs.241305 


gb:zd54e04.s 1 Scares Jelal_heart_NbHH19W 
estrogen-responsive B box protein 


1.71 
1.71 


2.70 


446066 


AI343931 


Hs. 149383 


ESTs 


1.71 


2!32 


411299 


BE409857 


Hs.69499 


hypothefcai protein 


1.71 


2.92 


408246 


N55669 


Hs,333823 


mitochondrial ribosomal protein L13 


1.71 


2.00 


454064 


A1336329 


Hs.301619 


Homo sapiens oDNA FU12536 fis, clone NT 


1.71 


1.93 


417381 


AF164U2 


H3.82042 


solute carrier family 23 (nucleobase lia 


1.70 


3.70 


427820 


BE222494 


Hs.180919 


Inhibitor of DNA binding 2, dominant neg 


1.70 


1.60 


400760 
456842 


BE145837 




9b:MR0+rr0208-101299-202-c07 Hnm Homo 


1.70 
1.70 


2.82 
2.17 


429966 


BE0ai342 


Hs.283037 


HSPC039 protein 


1.70 


1.18 


418444 


Alg02899 


H5.85155 


butyrale response fxtor 1 (EGF-response 


1.70 


2.47 


437450 


AL390154 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


1.70 


3.03 


415738 


BE539367 


Hs.295953 


ESTs, Wealdy similar to AF22004g 1 uncha 


1.70 


2.34 


406245 






Target Exon 


1.70 


1.99 


408483 


AA464836 
BE153275 


Hs.291079 


ESTs, Weakly similar to T27173 hypolhetl 
gb:PI«0-HT0335-180400-008-e11 HT0335Homo 


1.70 
1.70 


2.05 
205 


410190 


AW072328 


Hs.69728 


Homo sapiens mRNA; cDNA DKFZp566C0646 (f 




2.20 


434608 


AA805443 


Hs.179909 


hypothetical protein FU22995 


1.69 


2.36 


432170 


T56887 
AF244137 


Hs.18282 
Hs.206g7 


KIAA1 134 protein 
host cell factor homolog 


1.69 


2^11 


436293 


A1601188 


Hs.120910 


ESTs 


169 


2.37 


448524 


AB03Z34S 


Hs.21356 


hypotWcal protein DKFZp762K2015 


1.68 


2.48 


404231 






Target Exon 


1.68 


250 


463906 


AW444952 


Hs.257054 


ESTs 


1.68 


2.45 


437957 


BE277414 


Hs.5g47 


mel transforming oncogene (derived from 


1.68 


1.00 








FAT tumor suppressor (Diosophiia) homcio 






448813 


AH 69802 


Hs!22142 


cytochrome b5 reductase b5R.2 


168 


169 


429162 


AK001250 


Hs.197642 


hypothetical protein FU10388 


1.68 


3.13 


425556 


H27225 




hypotheti'cal protein FU13114 


1.67 


202 


405630 






Target Exon 


1.67 


3.00 


421405 




Hs.1 04058 


CGl-29 protein 




3.25 


422640 


M37984 


Hs.1 18845 


troponin C, sbw 


167 


1.23 


450857 


AA629075 
Z43948 


Hs.190090 
Hs,326444 


ESTs 

cartilage acidic protein 1 


166 


2.48 
2.55 


433821 
405595 


AW182416 




ESTs 

NM_000721*:Homo sapiens calcium cliannel, 


1.66 
1,66 


265 


433892 


A1929357 


Hs.323966 


Homo sapiens clone H63 unknown mRNA 


1.88 


197 


443558 


AA376798 


Hs.286122 


MDS024 protein 


1.66 


200 


412141 


A1183838 


H5.48938 


hypothetical protein FU21802 


1.66 


265 



wo 03/003906 
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Hs.301804 
Hs.100469 
Hs.4890 
Hs.2719 
Hs.35089 
Hs.334437 
Hs.343812 



Hs.336898 
Hs.301637 
Hs.9436 
Hs.39733 
Hs.14637 
Hs.315111 
Hs.196384 
Hs.15 ' 



407874 
446700 
438184 
405502 
447050 
457961 
436774 



421205 
424012 
427016 



433187 
446825 
441166 
425571 
405836 
432374 
449268 
400772 



Hs.122125 ESTs 



Target Exon 

proline synlhelase (xHtansoribed (bacle 
KIAA1494 protein 

myeloid/lymphoid or mixed-lineage leukem 

ubiquilifrconjugaling enzyme E2E 3 (homo 

HE4; epididymis-spedflc, whey-acidic pr 

contaclin 3 (plasmacytoma associated) 

liypotlielical protein MGC4248 

lipoic acid synlhelase 

upstream binding protein 1 (LBP-la) 

KIAA0560 gene product 

pleci<slrin homology domain-containing, f 

CGI-1 10 protein 

ESTs, Wealily similar to S65657 alpha-IC- 
ras homolog gene laraily, member E 
Homo sapiens, Similar to RIKEN cDNA 1810 
zinc finger protein 258 

Homo sapiens, clone MGC:15763, mRNA, com 

postsynaptic protein CRIPT 

kidney- and llver-spedflc gene 

nudear receptor co-iepressor/HDAC3 comp 

ptoslaglandin-endoperoxide synthase 2 (p 

ESTs 

a disintegrin and metalloproleinase doma 
solute carrier family 30 (zinc transport 
NM_022041*:Homo sapiens giant axonal neu 
vacuolar protein sorting 35 (yeast homol 
CS000994*:gl|10435784|dbj|BAB14668.1| (A 
mitochondrlai ribosomal protein U 

glioblastoma amplified sequence 
protein tyrosine phosphatase, receptor t 
C5001831:gi|11056014MNP_067661.1|ac 
eukaryotic translation elongation (actor 
Homo sapiens cWA FLJ14059 fis, clone HE 
■ an DMA sequence from clone RP11-145L2 



Hs.270721 
Hs.159054 
Hs.77578 
Hs.9g601 

Hs.102541 
Hs.1 37569 
Hs.191579 



Hs.1 24623 
Hs.1 64557 
Hs.293381 
Hs.344097 



C7000609':gi|628012|pir||A53933 myosin 1 
STAW-like protein containing SH3 and ITA 
ESTs, Wealdy similar to 138022 hypothetl 
hypothetical protein FU13224 
ubiquilln speclHc protease 9, Xchromos 
hypothetical protein FLJ 12553 
Target Exon 
nelrln4 

tumor protein 63 kDa wilh strong homolog 
ESTs 

ESTs, Weakly similar to spliceosomal pro 
Homo sapiens oDNA FU12961 fis, cbne NT 
hydroxyprostaglandin dehydrogenase 15-(N 
ESTs, Weakly s:mliEr to AF2086E5 1 BM-01 
hypolhefal protein FLJ20507 



AW161450 Hs.109201 



ESTs 



immunoglobulin kappa conslant 
Homo sapiens cDNA FLJ11346 fi; 
hypothetical protein FLJ20160 
NMJ03105*;Homo sapiens 
UDP-Gal:betaGlcNAc beta 1,4- galaclosylt 
zinc finger protein 1 36 (clone pHZ-20) 
CGI-86 protein 

membrane cefaclor protein (CD46, Irophob 
hypothetical protein DKFZp56401278 
Homo sapiens, clone MGC:16169, mRNA, com 
cyclin D1 (Pi^DI: paralhyrokl adenonnalos 



dovm-regulator of transcription 1, TBP-b 
ESTs, Weakly sMarto alternatively sp 
ESTs, Moderately similar to 138022 hypol 



wo 03/003906 
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417343 


AA197132 


Hs.231581 


myosin, heavy polypeptide 1 , skelelal mu 


1.58 


2.84 


416643 


U62531 


Hs.79410 


solute carrier family 4, anion exchanger 


1.58 


1,26 


400847 






NM_003105*;Homo sapiens sorlilin-related 


1.58 


1.48 


436760 


AW606927 


Hs.5306 


hypothetical protein DKFZp5a6F1 1 22 simil 


1.57 


1,67 


433427 




Hs.171889 


cholinephosptiolransferase 1 


1.57 


1.64 


461986 


BE246996 


Hs.318401 


hypothetical protein DKFZp554D1378 


1.57 




428901 


AI929568 


Hs. 146668 


K1AA1253 protein 


1.57 


2.23 


426028 
444604 


NM_001110 


Hs.172028 


a disintegrin and melalloproteinase doma 


1.57 


3.07 


AW327696 


Hs,11441 


chromosome 1 open reading frame 8 


1.57 


1.86 


439685 


W40445 


Hs. 235857 


ESTs, Weal<ly similar to 138022 hypotheU 


1.57 


3.07 


426996 


AW968934 


Hs,173108 


Homo sapiens cDNA: aJ21897 fis, clone H 


1.57 


2.01 


447343 


AA256641 


Hs,236894 


ESTs, Highly similar to S02392 dpha-2-m 


1.57 


2.83 


418942 


AI566004 


Hs.141269 


Homo sapiens cDI^IA: FIJ21550 fis, clone C 


1.57 


1.21 


418555 


AI417216 


Hs,87169 


hypothetical protein FU12577 


1.56 


3.08 


402368 






NM_Q21155*;Honno sapiens CD209 antigen (C 


1.56 


2.05 


419749 


X7350a 


Hs.93029 


sparc/osleoneclin, cwcv and kazal-like d 




2.08 


404977 






Insulin-lte growth feolor2(somatomedi 


l!56 


5.50 


441872 


BE567100 


Hs.1 54938 


hypothetical protein MDS02S 


1.56 


230 


415503 


U36601 


Hs.78473 


1.56 


2.56 


4S1743 


AW074266 


Hs.23071 


ESTs 


1.56 


1.85 


423184 


NM_004428 


Hs.1624 


ephrlnnAI 


1.56 


1.41 


408041 


AW138782 


Hs.243607 


ESTs 


1.56 


2.21 


416777 


AF146760 


Hs.79844 


DKFZP664M1416 protein 


1.56 


2.00 


428013 


AF151020 


Hs.181444 


hypothetical protein 


1.56 


1.53 


410072 


BE384447 


Hs.16034 


hypothetical protein MGC13186 


1.55 


1.52 


411495 


AP000693 


Hs,70369 


KIAA0136 protein 


1,55 


2.88 


4D8162 


AAgg3833 


Hs.1 18527 




1.55 


2,70 


413350 


U02556 


Hs.75307 


t-complex-associated-testis-expressed 1- 


1.55 


1.99 


422010 


AA302049 


Hs.31181 


Homo sapiens cDNA: FU23230 fis, clone C 


1.55 


1,60 


425229 




Hs.1 5521 2 


methylmalonyl Coenzyme A mutase 


1.55 


2,57 


426184 


BE27828a 


Hs.155048 


Lutheran blood group (Aubergar b arligen 


1.55 


1.45 






Hs.a9552 


glutathione S-transferase A2 






417538 


AW050865 


Hs.275711 


hypothetical protein Iu1GC2452 


1.55 


2.78 


409806 


AW500960 




gb:UI-HF-BP0p-aiy-b-01-0-Ul.r1 MIH_I^GC_5 


1.55 


2.45 


402737 






Target Exon 


1.54 


2.58 


419825 


AI75401 1 


Hs.7326 


ESTs 






410001 


AB041036 


Hs.57771 


kallil<rein 1 1 


1.64 




407813 


AL1 20247 


Hs.40109 


KIAA0872 protein 


1.54 


2.33 


415906 


AI751357 




Homo sapiens cDI^IA: FU22256 fis, clone H 


1.54 


2.77 


427886 


AA417083 


Hs. 104789 


ESTs 


1.54 


2.60 


437018 


AA889078 


Hs.ia7033 


ESTs 


1.54 


2.48 


415049 


N67334 


Hs.50158 


ESTs 


1.54 


2,57 


422315 


016296 


Hs.116176 


T-oell lymphonna invasion and metastasis 




2,67 


413715 


AW851121 


Hs.75497 


Homo sapiens cDNA: FU221 39 fis, clone H 


1.64 


1,98 




AI630759 


Hs.17481 


Homo sapiens clone 24606 mRNA sequence 


1.54 
1.53 


2.48 


438924 
445166 


BE636511 
AI656116 


Hs.147451 


transmembrane trafficking protein 




2,06 


414073 


AF068293 


Hs.75737 


psilcentrlolar material 1 


1.53 


1,70 


402378 






Target Exon 


1.53 


2.83 


4S2316 


AA2S8484 


Hs.61265 


ESTs. Uoierm sMar to G7B6_HUMAN P 


1.53 


1,60 


460374 


AA397540 


Hs.60293 


Homo sapiens done 1 22482 unknown mRNA 




3.59 


402617 






C100355l:gil6678693lreflNP_033547.1|v»in 


1.53 


275 


406837 


R70292 


Hs.156110 


immunoglobulin kappa constant 


1.53 


1.01 


410573 


AF151057 


Hs.64595 


aminoadlpale-semialdehydedehydrogenase- 


1.53 


1.23 


426359 


AA376409 


Hs.10862 


Homo sapiens cDNA FU2331 3 fis, clone H 


1.53 


0,67 


434445 


A1349306 


Hs.1 1782 


ESTs 




2,80 


452717 


AW1 60399 


Hs.30376 


hypothetical protein 


1.53 


2,01 


420465 


AL080275 


Hs.70488 


similar to prol(ar/otic-type class 1 pept 


1.53 


225 


437404 


AA868974 


Hs.180992 


ESTs 


1.53 


200 


459192 


Awnsm 




gb,'RC2-BT0214-010999-001-E07BT0214Homo 


1.52 


3.20 


446467 


AI300580 


Hs.345281 


ESTs, Moderately sinllar to ALU1_HUMAN A 


1.52 


2,35 


441466 


AW673081 


Hs.54328 


ESTs 


1.62 


1,99 


421810 


AK001718 


Hs.108530 


hypothetical protein FLJ10856 


1.52 


2.98 


447769 


AW873704 


Hs.320831 


Homo sapiens oDNA FLJ14597 fis, clone NT 






414882 


D79994 


Hs.77546 


Homo sapiens cDNA; FLJ21983 fis, cloneH 


{52 


256 


442169 


W21813 


Hs.8125 


Homo sapiens mRI^A; cDI^A DKFZp586E1521 (f 


1.52 


1,31 


404349 






Target Exon 




274 


416278 


AA366366 


Hs.79137 


protein-L-isoaspartate (D-aspartate) Om 


l!52 


293 


431846 


BE019924 


Hs.271580 


uropiakin IB 


1.52 




431968 




Hs.2877 


cadherin 3, type 1, Pmadherin (placenta 


1.52 


0J3 


442670 


BE410060 


Hs.11859 


hypothetical protein FU 13188 




270 




M5S1S63 


Hs.1 78485 


Homo sapiens cDNA FU13919 lis, clone Y7 


1^52 


1.65 


440079 


A1567284 


Hs.6900 


ring finger protein 13 


1.52 


1.76 


432831 


AI821702 


Hs.1 15959 


ESTs, Weakly similar to 138022 hypolheti 


1.62 


213 


414320 


U13616 


Hs.75893 


ankyrin 3, node of Ranvler (ankyrln G) 


1.52 


213 


442149 


AB014660 


Hs.8118 


KIAA0B50 protein 


1.62 


1.00 


457747 


AW975Q00 




gb;EST387106 l*\GE resequences, MAGN Homo 


1.61 


238 



wo 03/003906 



PCT/LS02/21338 



419433 


AA814807 


Hs.7395 


hvpolhetical prolein FU231B2 


1.51 


Z50 


431812 


AA515902 


Hs.130650 


1.51 




415477 


NM002228 


Hs.78455 


v-jun avian sarcoma virus 17 oncogene lio 


1.51 


2.62 


447580 




Hs.1 33487 


ESTs 


1.51 








Hs.108920 


HT018 protein 


1.51 


2.22 


442755 


W57656 


Hs.109701 


Ubiquilin-like 5 


1.51 




448694 


AA478756 


Hs.194477 


E3 ubiquilin ligase SMURF2 


1.51 




422575 


BE018617 


Hs.119140 


eularyolic translation inilialion factor 


1.51 


1,49 


404397 






ENSP00000251675*:K1AA1550 protein (Fragm 


1.51 


2.18 


412927 


M284018 


Hs.75053 


human immunodeficiency virus type 1 enha 


1.61 


1.33 


402371 






Target Exon 


1.51 




431730 


AF208856 


Hs.268122 


hypothetical prolein 


1.51 


1.57 


417715 


AW969587 


Hs.86366 


ESTs 


1.51 


1.59 




AA016752 


Hs.205173 


ESTs 


1.50 


270 


434727 


H43374 


Hs.7890 


Homo sapiens mRNA for KIAA1671 prolein, 


1.50 


3.53 


442297 


NM_005202 


Hs.89901 


phospliodiesterase 4A, cAMP-specific [dun 


1.50 


2.24 


425883 




Hs.161031 


Homo sapiens mRNA; cDNA DKFZp434K0322 (f 


1.50 


1.13 


452658 


N88604 


Hs.30212 


thyroid receptor interacting protein 15 






428695 


AI355647 


Hs.1 89999 


purinergic receptor {family A group 5) 


1.60 


1.00 


438967 


H30340 


Hs.173706 


Homo sapiens cDNA: FU220S0 ts, done H 


1.50 


1.06 


419847 


AW390601 


Hs.184544 


Homo sapiens, clone 1MAGE:33S5383, mRNA, 


1.S0 


253 


431369 


BE184455 


Hs.251754 




1.49 


a97 


433266 


AB040971 


Hs.35096 


KIAA1538 prolein 


1.49 


1.44 


408136 


AL041135 


HS.429S9 


KIAAlai2proyn 


1.49 


2.24 


455485 


AA102287 


Hs.26756 


hypothetical protein FU20896 


1.49 


2.40 


418863 


AL136743 


Hs.25566 


ESTs, Weakly similar b 2004399A chromos 


1.49 


Z84 


405193 






C7000789:gil1943947|gb|AAC48716.1l(U901 


1.48 


2.20 


408948 


AW296713 


Hs.221441 


ESTs 


1.48 


2.20 


42S088 


AF038007 


Hs.166196 


ATPase, Class 1, type 8ft member 1 


1.48 


2.24 


405932 






C15000305:gi|3806122|gb|AAC69ig8.1| (AFO 


1.48 




454034 


NtA.000691 


Hs.575 


aldehyde dehydrogenaEe 3 family, member 


1.48 




422355 


AW403724 


Hs.300697 


coagulation factor VII (serum protombi 


1.48 




428044 


AA093322 


Hs.301404 


RNAbinding motif protein 3 




238 


416166 


AW501907 


Hs.261734 


Homo sapiens cDNA: FU22807 Us, done K 


1.48 


1.28 


430453 


BE387060 


Hs.3903 


Cdc42 effector protein 4; binder of Rho 


1.48 


2.73 


401600 


BE247275 




U5 snRNP-specific proteh, 1 16 kO 


1.48 


2.53 




A1017717 




chromosome 21 open reading frame 15 




2.03 


405194 






C7000789:gi|ig43947|gb|AAC48716.1| (U901 


1.48 


200 


416179 


R19015 


Hs,79067 


MAD (mothers against decapenlaplegic, Dr 




1.25 


450272 


A1075170 


Hs.20010 


ESTs 


1.48 


2.35 


413709 


BE158687 




gb:CI^0-HT0395-280100-169-b09HT0395Homo 




2.08 


442507 


AA507576 


Hs.288351 


Homo sapiens cDI>IA: FLJ22696 fis. done H 


1.48 


1.00 


410418 


D31382 


Hs.63325 




1.47 




436566 


BE545586 


Hs.278712 


Homo sapiens cDNA FU11074 fis, done PL 


1.47 


2.26 


404769 






NM_007037':Homo sapiens a dislnlegrin-li 


1.47 




420132 


BEff79847 


Hs.301914 


gb:RC6-BT0627-220300-012-DQ6 BT0627 Homo 




2.00 


448355 


AL120837 


Hs.20993 


high-glucose-regulated protein 8 


1.47 




421528 


ALt213)7 


Hs. 106210 


hypolheiical prolein F1J10813 




4.08 


449059 


AK000566 


Hs.98135 


hypothetical protein FU20559 


1.47 


ai3 


449029 


N28989 


Hs.228gi 


solute carrier family 7 (cationic amino 


1.47 


1.06 


422119 


A1277829 


HS.1118B2 


KIAAOS90 gene product 


1.47 


1.51 


438713 


H16902 




ESTs 


1.47 


239 


418248 


NM005000 


Hs.83916 


NtL0Q5000*:Homo sapiens NADH dehydrogena 


1.47 


1.00 


419125 


AA642452 


Hs.130881 


B-cell CLUIymphoma 1 1 A (zinc finger pro 


1.46 


2.20 


420548 


AA278246 


Hs.920 


ESTs 


1.46 


213 




AA433848 


Hs.107882 


hypothetical protein FLJ 10659 


1.46 


1.98 


414683 


S78296 


Hs.76888 


hypolhetical protein MGC12702 


1.46 


1.45 


427045 


H86504 


Hs.173328 


prolein phosphatase 2, regulatory subuni 


1.46 


231 


446646 


BE552004 


Hs.26192 


ESTs, WeakV similar to ALU1_HUMAN ALU S 


1.46 


1.30 


427257 


AI026805 


Hs.g7726 


ESTs 


1.46 


248 


422971 


AI879223 


Hs.145409 


RAB, member of RAS oncogene family-like 


1.46 


1.05 


451334 


AI12269f 


Hs.13268 


ESTs 


1.46 


212 


403326 






C2000428*:gi|77053a3|reflNP-057536.1|GC 


1.46 


2.40 




AF2Q1948 




BUP protein 








AI805664 


Hs.31731 


peroxiredoxin 5 


1.46 


1.56 




AW239226 


Hs,65450 




1.46 


1.49 


430688 




Hs. 104991 


hypothetical protein similar to preferen 


1.46 


245 


438083 


A1949940 


Hs,121924 


ESTs 


1.46 


2.00 


430713 


AA351647 


Hs.2642 


eukaryolic Inanslaton ebngalion factor 


1.45 




437325 


AF1 42481 


H5.6648 


f-box and leudne-rich repeat protein 6 


1.45 




403342 






Target Exon 


1.45 


221 


438808 




H5.129053 


Homo sapiens NOTCH 1 (N1) mRNA, complete 


1.45 


2.40 


446493 


AK001389 


Hs.16144 


hypothetical protein DKFZp564O043 




3.65 


414895 


AW894856 


Hs.116278 


HomosapienscDNAFLJ13571 fis, clone PL 


1.44 


271 


442072 


AI740832 




Homo sapiens clone 23570 mRNA sequence 


1.44 


1.08 


425723 


NM_014420 


Hs.159311 


dickkopf (Xanopus laevis) homolog 4 




2.24 


432901 


AI554929 


Hs.281866 


ATPase, H bansporling, lysosomal (vacuo 


{44 


1.63 



wo 03/003906 



PCT/LS02/21338 



412210 
421686 
428115 



413798 
410937 
400397 
405902 



AW901492 

AF189723 

AB023ig4 

BE567985 

L39874 

AA336708 

AA218564 

AJ270770 

AA620987 



429221 
422122 
406231 



gb:RC0-NN1012-27030tO31-h10 NN1012 Homo 
ATPase, Ca Iransporting, type 2C, member 
1 KIAA0977pcoletn 

ESTs, Moderately similar to ALU4.HUMAN A 
dCMPcfeamlnase 

capping protein (actin filament) musde 
vacuolar protein sorting 26 (yeast homol 
transcription factor 7-te 2 (T-oell sp 
Target Exon 



405376 






Target Exon 




436086 


Z43133 


Hs.9961 


Homo sapiens oDNA: FU21954 fis, clonB H 




418182 


AW016405 


Hs.16548 
Hs.23a944 


hypotfiefcal protein FLJ1fl631 


1.43 


434924 


AA443164 


Hs.23259 


tiypothelical protein FU13433 




417821 


BE245149 


Hs.82643 


protein tyrosine l<inase 9 
Target Exon 


1.43 
1.43 


405418 






Target Exon 
Target Exon 


1.43 


451608 


AA384525 


HS.2B746 


Itypotltetel protein 


1.43 


424099 


AF071202 


Hs.1 39336 


ATP-blndlng cassette, sub-family C (CFTR 


1.43 


401041 






C11000425.-glI4507721|rellNP_003310.1| tl 


1.43 


417839 


AI816732 


Hs.82712 


fragile X mental retardation, autosomal 


1.43 


409245 


AA361037 


Hs.288036 


tRNA isopenlenylpyrophospliate transferas 


1.43 


447808 


NM_007265 


Hs.19673 


suppressor of S, cerevlsiae gcr2 


1,43 


456492 


AA330047 


Hs.191187 


ESTs 


1.43 


449244 


AW859979 


Hs.32204 


ESTs 


1.42 


413094 


H24184 


Hs.25413 


TOUJP protein 


1.42 


452407 


AA682g09 


Hs.29353 


brain-specific protein p25 alpha 


1.42 


407674 


AW064061 


H5.279145 


ESTs 


1.42 


441297 




Hs.7766 


ubiquitin<on(ugaling enzyme E2E 1 (homo 




421932 
426348 


W51778 
BE466586 


H5.323949 
Hs.1 7433 


kangai 1 (suppression of tumorlgenidty 
tiypottietical prolein FU20967 


{42 


432554 


A1479813 


Hs.278411 


NCK-assooialed prolein 1 


1^42 


431736 


AW977724 


Hs.75968 


thymosin, beta 4, X chromosome 




429953 


HU 004376 


Hs.226581 


C0X15 (ysasl) homolog, cytochrome c oxid 


U2 


444037 


AV647686 


Hs.42733 


CHMP1.5 prolein 




402144 






Target Exon 


1.42 


456758 


AA325170 


Hs.224627 


ESTs, Weal<ly similar lo FAHUAA alpha-aci 




452322 




Hs.2a988 


glutaredoxin (thioltransferase) 


142 


426863 


AL137657 


Hs.1 72803 


liypotliclical protein lv1GC10327 


1.41 


410684 


AA088500 


Hs.1 70298 


ESTs 


1.41 


401784 






MM_0022ao':Homo sapiens l<eratin, tiair, a 




427523 


BE242779 


Hs.1 79526 


upregulated by 1,25-dihydroxyvitamin 0-3 


1.41 


449269 


AI564682 


Hs.175870 


ESTs 


1.41 


406467 






Target Exon 




444339 


T96555 


Hs.31562 


ESTs 


1.41 


431563 


AI027643 


Hs.120912 


ESTs 


1.41 


413343 


BE392026 


Hs.334346 


hypothetical protein MGC13045 


1.41 


447637 


AW295072 


Hs.346408 


ESTs, Weakly similar to AF193556 1 sacsi 


^'t 


428211 
406248 


AA424211 


Hs.183176 


ESTs 




437412 


BE069288 


Hs.34744 


HOTO s^iens mRNA; oDNA DKFZp547C13B (fr 


l'.41 


414653 
403885 


M24486 


Hs.76768 


procollagen-praline, 2-oxoglutarato 4-di 
Tai^Exon 


1.41 


439459 


AF086279 


Hs.58013 




U1 


419075 
405022 


T84266 


Hs.123927 


Target Exon 


1^40 


401346 


BE041451 




hypotiieHcal protein 


1.40 


415660 


AI909007 


Hs.78563 


ubiriuitln-conjugating enzyme E2G 1 (homo 


1.40 


448023 


AI693299 


Hs.1 70388 


ESTs 


1.40 




AA702820 


Hs.291294 


ESTs 


1.40 


432480 




Hs.275865 


ribosomal protein S18 


1.40 




AK000639 


Hs.75884 


DKFZP586A011 protein 


1.40 


440256 


U23841 


Hs.18851 


hypottiellcal prolein FU10876 


1.40 


413809 


L25851 


Hs.851 


integrin, alpha E (antigen CD103, human 


1.40 


408176 




Hs.43436 


adenylate kinase 3 alpha like 


1.40 


433960 


AW629188 




ESTs 


1.40 



BE011183 
A1B21060 
AA383642 



C6001430"fli|450352l|rBf|NP_OOl559.1|mu 
Target Exon 

gb:PM3-BN0218-10Q500-003Ki09 BN0218 Homo 
Target CAT 

lysosomal-associated protein transmembra 
Taigel Exon 
Target Exon 

gbDw23a1Dj(1 Soares_paratl)yfoklJumor_N 
14':Eoolropiovlws Integra 

206 



wo 03/003906 



PCT/LS02/21338 



453258 
448261 
427666 



430138 
432841 
444051 



400275 
403725 
443211 
421510 
430071 



447643 
401693 
403807 
406356 
401886 
421110 
427449 
427451 



BE244072 
A1791495 
AW992356 
BE315072 



N48373 
M69241 

AW139606 



AW978436 

NM_002709 
BE065047 
H10767 

N47812 



446044 
400967 
3 5 414506 



Hs .32597 

Hs.208l5 

Hs.180142 

Hs,8364 

Hs.78768 

Hs.234265 

Hs.62 

Hs.10247 

Hs.162 

Hs.221067 



Hs.143666 

HsAOSm 

Hs.23206a 

Hs.208647 

Hs.6763 

HS.134C16 

Hs.62515 

Hs.21537 
Hs.86412 
Hs.238465 



Hs.1356 

Hs.178112 

Hs.178137 

Hs.166547 

Hs.91390 

Hs.13572 

Hs.76293 



413691 
404906 
436246 
441478 
419715 
426261 
400129 
450447 
434697 
430308 
434767 
459729 



452065 
427979 



An9iai3 

AK002082 

H09548 

AK000574 

AA300067 

W61216 

AW004975 

AA318224 

NM.014072 

NM_000997 

AA031675 



AW450963 
AA350018 
AF070523 
M24283 



75 440512 
409865 
447390 



A1078645 

BE379776 

R16702 

BE297886 

BE242866 

AW811502 

AW500332 

AA887a45 



ring finger protein (C3H2C3 type) 6 
macrophage eiylliroblast allacher 
calmodulln-like skin protein (Cl^P) 
Homo sapiens pynivate deliydrogenase kina 
malignant cell expression-enhanced gene/ 
DKFZP686G011 protein 
protein tyrosine pliosplialase, non-recept 
activated leucocyte cell adiiesion molecu 



ngprote 



ESTs, WeaWy similar to ALU1 JtUliMN ALU S 
NIM.006513*:Homo sapiens seryl4RNA synth 
Target Exon 



ESTs 

K1AA0942 protein 
uronyl 2-sulfbtransferase 
KIAA0494 gene product 
ENSP00000067222*:Mil 
protein phosphatase 1, catalytic subunit 
chromosome 9 open reading frame 5 
nGAP-lilie protein 
Target Exon 



NI«_021783:Homo sapiens XEDAR (XEDAR), mR 
calhepstnE 

DNA segment, single copy prolie LNS-CAin. 



Hs.25063 
Hs.337445 
Hs.31917 
Hs.75478 

Hs.1 19991 
Hs.301342 
Hs.92384 
Hs.168383 

Hs.25010 
Hs.4084 
Hs.238990 

Hs.289848 

Hs.171695 
Hs.48924 

Hs.182217 

Hs.27721 

Hs.431 
Hs.181309 



poly (AOP-ribose) glycohydiolase 
calcium modulating Irgand 
Target Exon 
thymosin, tiela 10 

NM.021186'-.Homo sapiens zona pellucida g 
ESTs 

hypothetical protein FLJ11220 

ESTs, Weakly similar to 138022 hypotheti 

nitrogen Tixalion cluster-lite 

hypothetical protein □KFZp434N185 

epithelial V-like antigen 1 

MAD (mothers against decapentaplegic, Dr 

ESTs 

PRO0461 protein 
ribosomal protein L37 

IHomo sapiens, clone MGC:S658, rnRNA, comp 
ATPase, Class VI, type 116 
NI^_02S213:ttomo sapiens spectrin, beta, n 

ESTs 

hypothelical prolein IVIGC4342 
vitamin A responsive; cytoskelelor relat 
intercellular adhesion molecule 1 (CD54) 
Eos Control 

hypothelical protein P15-2 
KIAA102S protein 

cyciil^dependentkin»einhibllDr1B(p2 
C2H2 (Kruppel-type) zinc finger piolein 
EST, Weakly similar to ALU4_HUMAN ALU SU 
dual specifclty phosphatase 1 
KIAA0512 gene product; Al£!<2 
gb.UI-HF-BR0p-*a.b-05O4Jl.r1 NIHJ4GC_5 
Target Exon 

succinale-CoA ligase, ADP-forming, beta 
Target Exon 

Wolf-Hirschhom syndrome candidate 1-lii< 
G9000748:gi|8324209|gb|AAB34384.2|(S775 
murine leukemia viral (bml-1) oncogene h 
protsasome (prosome, macropain) subunit, 



HepA-related protein 
gb:QV2-ST0146-061299-01Hi04 ST0145 Homo 
hypothetical prolein dJ1181N3.1 
suppressor of S. cerevisiae gcr2 
gb.UI-HF-BR0p-q'ii-e49-O-Ul.r1 NIHJUGC.S 
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450293 
445831 
418610 
441946 
446192 
41S285 
425590 
407498 
441331 
411789 
420542 



410846 
443937 
432360 



401371 
405443 
453764 
424924 
453555 
404343 
412383 
404250 
413899 
422716 
448862 
409540 
431186 
402764 
420798 
459710 
435192 
401383 
453394 
421820 
444047 



447015 
414015 
414843 
424058 
401196 
450147 
422S99 
405172 
434087 
416720 
426521 
442685 
443879 
405180 
417365 
402087 



404273 
416204 
435076 
452497 



427195 
438129 
402138 



NU_000505 
AL359053 



BE045243 
M312264 
AA669735 



BE008180 
AL039103 
N23574 

AW947577 

AF083892 
AI702835 
AI35ig79 
AW40SS6g 
NM.012249 



Hs.223394 
Hs.120775 
Hs,14231 
Hs.48617 
Hs.158321 

Hs,149971 
Hs.72157 

Hs,1321 
Hs,75607 
Hs,57664 
Hs,77508 
Hs,106823 



AW960474 

AW662990 

AI097452 

R10482 

AA3B4109 

BE410992 

AB033029 

AA340987 

BE336038 

AL121616 



NM_001649 
AW511413 
BE268048 



AW972270 
AW298113 
AA732153 

AW1 60835 

W27230 

AA778647 



Hs.75508 
Hs.124475 
Hs.152717 



Hs.11110 
H5,171391 
Hs.8594 



Hs,24601 ESTs 



hypothetical protein FU0002B 
LanC (bacterial lanlWofc synthetase c 
hypothetical protein MGC2742 

selenoprotein W, 1 

Homo sapiens cDNA FU12540 lis, clone NT 
beaded fflamenl structural protein 2, pb 
gb;Human HMGl-C chimeric transcript mRNA 
ESTs, Moderately similar to ALUB.HUMAN ! 

coagulation factor XII (Hageman factor) 

myrlstoylated alanine-rich protein l<inas 
Homo sapiens mR|v|A full length insert cDN 
glutamate dehydrogenase 1 
hypothetical protein MGC14797 
■ ;l^[^-ST0062-031199-018-b03ST0062Homo 



Target CAT 

hypothetical protein MGC1 2966 
protein phosphatase 4 regulator/ subunit 
ENSP00000198192":BA43aF9.) (novel protei 
Target Exon 

Homo sapiens cDNA FLJ14353 fis, ctans Y7 
pumlllo (Drosophlla) homolog 1 
ESTs, Moderately similar to ALU7_HUMAN A 
C700219i ■.•gi|5053028(3l)(AAD3881 1. 1|AF155 
gb:RC0-MT0004-140300-031-b09 lvlT0Ca4 Homo 
Target Exon 

tight junction protein 2 (zona occludens 
ESTs, Weakly similar to YEF4_YEAST HYPOT 
hypothetical protein FU1372S 
gb:fh01e09.x1 NIHJ'GCJ 7 Homo sapiens cD 



Hs.1 35095 ESTs 
Hs.132a76 ESTs 
H5.177990 
Hs,258730 



Hs.278025 ESTs 



Hs.2391 
Hs.2780! 
H5,236494 

Hs.107528 

Hs.144054 
Hs.92909 
H5,27865 



prolyloarboxypeptidase [anglotenslnase C 

thyroid hormone receptor Interaclor 12 
Target Exon 
CGi-145 protein 

emsl sequence (mammar/ tumor and squamou 
Target Exon 

hypothetical protein PR01942 
hypothetical protein iyiGC2508 
C-termlnal binding protein 2 
KIAA1191 protein 

Homo sapiens mRNA; cDNA DKFZp564D156 (fr 
pho5phoinositide-3-l<inase, catalytic, ga 
transforming grotidh factor, beta tecepb 
Target Exon 

apical protein, Xenopus laevls-like 



RABID, member RAS oncogene family 
Eos Control 

androgen induced protein 

Target Exon 

ESTs 

SON DNA binding protein 

Homo saplenscDNA; FLJ21333 fis, clone C 

Target Exon 

hypolheiteal protein FU21620 
eukaryotic translation initiation factor 
gb:afB7d03£l Scares JesBs_NHT Homo sap 



NIlA.018936»:Homosai 
E3ubiqultinligasBSMURFl' 
Hs£5610 solute carrier family 30 (zinc transport 
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434585 
417219 
428125 
416188 

406621 
436663 
417260 
434978 



412824 
42Q037 
459221 



448466 
450098 
405053 
428915 
443721 
452047 
440213 
452900 
418721 



447966 
430316 
416272 
437456 



454555 
406872 
401720 
400082 
420133 
411579 
402191 
457118 
408576 



455476 
445926 
432647 
405435 
406140 
426201 



437352 
456535 
434202 
439528 
400178 
430023 
412841 



AW973473 
BE1 57260 



NM_002731 

AA373131 

R40815 

AA548906 

M340605 

NM_000875 

M178882 

AL047046 

HB8741 

NM.001406 

H91282 

AW807095 



Hs.184075 ESTs.WeaklysinnilailoALUIJHUMANALUS 

Hs.220936 ESTs 
Hs.182579 leucine aminopeptidase 
Hs.79070 
Hs.288316 
Hs.181125 immunoglobulin lamhdalociis 
Hs.5258 chromosome 1 1 open reading tranne2 
Hs.332115 ESTs 

Hs.4310 eukaryolic translation initiation faclor 

ttiyroid tiormone receptor-associated prot 

Hs.1608 insulin-degrading enzyme 

Hs.336948 Homo sapiens, clone llu1AGE:3530891 , mRNA, 

Hs.233550 zinc finger protein 208 

Hs.273629 ESTs 

gb;RC3-HT0600-060400-022-h10HT0600Homo 
Target Exon 

Hs. 11 2378 LIM and senescent cell aniigen-like donna 
Target Exon 



Hs.11261 
Hs.l3556g 
Hs.306121 
H3.104105 



smal prollne-rich protein 2A 
hypoMcal protein FU14708 
leukocyte receptor cluster (LRC) encoded 



Hs.66850 ESTs 



gb:MR4-BT0368-14O40Ofl06-g10 BT035B Homo 



Hs.87773 
Hs.24322 
Hs. 12396 



Hs.182469 
Hs.46423 
Hs.301636 
Hs.132569 
Hs.183868 



NM.00407g:Homo sapiens cathepsin S (CTSS 
ESTs 

hypothetical protein FIJ21080 
Target Exon 

SnQ-ielated CBP acllvalor protein 
ESTs 

ESTs, Weakly similar to BOX B BINDING FA 
succinate-CoAligase, GDP-fbrming, itpha 
piothymosin, elpha (gene sequence 28) 
protein kinase, cAMP-dependent, catalyti 
ATPase, H transporting, lysosomal (vacuo 
ESTs, Weakly similar to 20G43ggA diiomas 
ESTs 

ESTs, Weakly similar to Homolog of rat Z 
Insulin-like growth factor 1 receptor 
gb:zp38b09.rl Stratagene muscte 937209 H 
Homo sapiens done 1 22482 unknown mRNA 
ESTs 
ephHn-B3 

Homo salens cDNA: FU23190 lis, done L 
gb:MR4-ST0062.fl401 00-024-002 ST0062 Homo 
gb:wlD9h08j(l Na_CGAP_cai Homo sapiens 
NM.01458r:Homo sapiens SRY (sexdeterml 
EosCcnlTDl 
ESTs 

US snRNA-associated Sm-IIke protein LSm7 
NI\L021733':Homc sapiens tesUs-speclfc 
Homo sapiens mRNA; cDNA DKFZp564K1972 (f 
H4histone family, member G 
peroxisomal biogenesis factor 6 
PP2135 protein 



AW182614 
AI9272C8 
NM_005479 



BE391854 
AA282735 
AL353957 
AA305079 
BE382411 
BE613180 



gb;RCO-MT0013-280300-021-b06 MT0013 Homo 
splicing factor 3b, subunit 1, 155kD 
fibroblast growth factor receptor 2 (bac 

Target Exon 
ESTs 

matrix metalloproteinase 28 
frequently rearranged in advanced T-cell 
ESTs, Weakly similar lo MGB4_HUMAN MEl-AN 
hypothetical protein FU12577 
gb:601285394F1 NIH_MGC_4' 
AD031 protein 

hypothetical protein DKFZp434P0531 
cytochrome cc 



Homo sapiens cDNA: FU21314 fis, cli 
Eos Control 

FKMB-binding protein 2 (13kD) 
■ ■■ •■ • ■5inMGC11352 
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BE614551 


Hs.738 


ribosomal protein LI 4 


1.25 


1,22 


449S36 


AI65B808 
X73601 


Hs.281328 
HS.8490S 


ESTs, Weakly similar to T00378 K1AA0641 
cyloksratin 20 


1.25 
1.24 


aoo 

2.11 




Y00285 


H5.7B473 


insullrvlike growth factor 2 receptor 
ESTs 


1.24 


1.14 


436967 


AA761729 


Hs.136705 


1.24 


2.53 


457215 


AA452664 


Hs,283697 


ESTs, WeakI/ similar to A41796 neural re 


1.24 


218 


41 8414 


J04977 


Hs.84981 




1.24 


1.36 


423217 


NM_D00094 


Hs.1640 


colagen, type VII, alpha 1 (epidermolys 


1.24 


0.92 


421733 


All 19671 


Hs.1420 


fibroblast growth factor receptor 3 (ach 


1.24 


Z71 


402329 
447525 
445939 






NM.006505*:Homo sapiens pollovirus recep 


1.24 


1.13 


AF151031 
BE018668 


Hs.300631 
Hs.141003 


hypothetical protein 

Homo sapiens cDNA; FU21691 Us, done C 


1.24 
1,24 


223 


421936 


AB0408a4 


Hs. 109694 






2.16 


433681 






Homo sapiens oDNA FLJ13027 fis, clone NT 


1,24 


2,15 


426717 


N90977 


Hs.49690 


Homo sapiens mRNA; cDNA DKFZp434D2328 (f 






404751 


T70445 




ribosomal protein L9 


124 


1,30 


411456 






gb:IL3-CT0213-161299-038-G09 CT0213 Homo 


1.24 


235 


425417 


AF098948 


Hs. 1571 13 


coenzyme Q, 7 (rat, yeasi) homolog 
ESTs 


1.24 


2.88 


434508 


AI648601- 


Hs.118012 


1.24 


2.03 
1.59 


428284 




Hs. 183435 


NlvL004646:Homo sapiens l>IADH dehydrxjgenas 


1.24 


418597 


AK001678 


Hs.86337 


similar to DNA-dlrected RNA polymerase 1 


1.24 


2.27 




AW250089 


Hs.75807 


PDZ and LIM domain 1(eUn) 


1.24 


1,53 


449210 
439551 


AI635363 
W72062 


Hs.345517 
Hs.11112 


ESTs 
ESTs 


1.24 
1.24 


2.18 
2,13 


426244 


AI064808 


Hs. 168289 


succinate dehydrogenase complex, subunlt 


1,23 


1,06 


453635 


BE14a082 




MFH-amplified sequences wllh leudne-rte 


1,23 


1.34 


429617 




Hs.211563 


B-cell CLLJlymphoma7A 


1.23 


2,27 




AI929819 


Hs.92909 


chromosome 21 open reading frame 60 


1.23 


Z10 


417010 


NM 006225 


Hs.80776 


phospholipase C, delta 1 


1.23 


1.21 


426508 


W23184 


Hs.170171 


glutamale-ammonia ligase (glutaminesynt 


1.23 


1.37 


434055 


AF168712 


Hs.3726 


X 003 protein 


1.23 


1.58 


438363 


A1886361 


Hs.22353 


hypothetical prolelnFU21952 


1.23 


2.44 


450937 


R49131 


Hs.26267 


ATP-dependanI Interferon response prola 


1.23 


2.28 


407018 


U49869 




NM_01895S:Homo sapiens ubiqultin B (UBB) 


1.23 


1.44 


444981 


AW855398 


Hs.12210 


hypothetical protein FU13732 similar to 


1.23 


1.19 


440112 


AA099014 


Hs,231029 


Homo sapiens, done MGC15961, mRNA, com 


1.22 


207 


426672 
404956 


AW270556 


Hs.171774 


hypothetical prelein 

C1003210*:gl|6912582|ref|NP_03B524.1|pe 


1.22 
1.22 


2.18 


435088 


NMJ004ai 


Hs.102 


aminomelhyltransferase (glycine cleauage 


1.22 


1.08 




AW274454 


Hs.6318 


peroxisomal short-chain alcohol dehydrog 


1.22 


1.02 


434454 


AF217798 




LlSI-InleraclIng protein NUDEL; endoolig 


1.22 


1.27 


425689 


W16480 


Hs!24283 


ESTs, Moderately similar to reduced exp- 


1.22 


2.52 


428755 


087454 


Hs.192966 


KIAA0265 protein 


1.22 




420685 


AA279362 




gb:z584d04.rl NCI CGAP_GCB1 Homo sapiens 


1.22 


2,75 




A1743502 




gb:wf63h12.x2 Soares.NFL.T_GBC.S1 Homos 


1.22 


2.39 




X06370 


Hs.77432 


epidermal growth factor receptor (avian 


1.22 


1.00 


434023 


A1277883 
AI580935 


H5.146141 
Hs.105698 


ESTs 
ESTs 


1.22 
1.22 


2.12 


414880 


AW247306 


Hs.1 19140 


eul<aryolic translation initialion feolor 


1.21 


1.16 


454144 


BE280478 


Hs.182695 


hypothetical protein MGC3243 


1.21 


1,04 


404790 






C12001707':gl|7305215|ref|NPJ38599.1l l< 


1.21 


2.05 


403943 
400201 






C5000355;gi|4503225|reflNP_000765.1| oyt 
NM_006156*:Homo sapiens neural precursor 


1.21 
1.21 


2,05 
1,35 


421005 
400789 


AW293089 
X02419 


Hs.33263 

Hs.77274 


ESTs 

plasminogen activator, urokinase 

C1 1001 367*;gi|1C762051pirllS50754 hypolh 


1.21 
1.21 
1.21 


2.02 
1.06 


412853 


M34175 


Hs.74626 


adaptor-related protein complex 2, beta 




1.24 


449709 


BE410592 


Hs.23918 


hypothetical protein PP5395 


1.20 


1.27 


413726 


AJ278465 


Hs.75510 


annexin All 


1.20 




428485 


NM_002950 




ribophorin 1 




U4 


405163 






C5000561*:gl|7513700|pirllT14151 Inv pro 






415887 


NM 003375 


Hs.78902 


voltage-dependent anion channel 2 


1.20 


1.16 


434468 


N2g309 


Hs.39288 


ESTs 


1.20 


3.20 




AW1 35925 




hypothetical protein MGC11332 






432642 


BE297535 
R6C645 


Hs!3069 
Hs.20768 


heat shock 70l(D protein 9B (mortalin-2) 
HSPC189 protein 


120 


2A8 
1.22 


415753 


U62819 


Hs.78781 


vascular endothelial grov<th factor B 


r20 


1,05 


442156 


AI690586 


Hs.29403 


hypothetical protein FLJ22060 


1.20 


1.18 


408824 


L8D005 


Hs.48375 


small nuclear ribcnucleoproteln polypept 


1.20 


1.45 


430012 


NM_016373 


Hs.227637 


chromosome 22 open reading frame 2 


1.20 


1.03 


413392 


AW021404 


Hs. 13021 


ESTs 


1.20 




401286 






Target Exon 


1.20 


108 


415665 




Hs.274430 


surfeit 6 


1.20 


2.53 


456562 


aISw 


Hs.102669 


DKFZP43401 25 protein 


1.20 


2.40 


408988 


AL119844 


Hs.49476 


Homo sapiens ckine TtJA8 Cri-du-chat regl 


1.20 


3.45 


427818 


AW511222 


Hs.1 93765 


ESTs 


1.19 


2.99 


406404 






NM_002182*:Homo sapiens Inlercelliilar ad 


1.19 


1.17 
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25 
30 
35 
40 
45 
50 



400124 
415023 
427751 



406181 
422559 
409524 
438446 
447980 
425503 
411469 



407970 
440214 
405646 
438433 
431427 



405058 
446623 
419754 



426295 
448323 
414244 
442872 
425318 
415667 
401058 



405166 
401038 

414052 



AF000152 
AA128965 



AW247696 
AW402151 
AW137476 



AA973904 
AW296385 
AI732617 



447495 
428422 
421782 
405855 
428972 
406761 
432425 

424454 
418242 
437407 
447459 
426682 
403655 
433156 
403826 



AI970399 
AW401864 
A1557280 
AA297546 

AK001470 
AI241715 
AF070619 



AV660038 
R59206 



ris.180669 conseived gene ampllfl 



Hs.155839 
Hs.54673 
Hs.135204 
Hs.202355 
Hs.1 58203 



■■•Stenjl regulatory elema 
TATA box binding protein [TBPj-associate 
ppolefn lyrasine phospliatsBe, non-recepi 
Eos Control 

CX001 21 2*:gi|7861 932|gb|AAF70445.1| (AR 
Target Exon 

liypollietical protein l«GC12934 
tumor necrosis factor (ligand) superfami 
Homo sapiens cDNA FU138B4 lis, clone TH 

ESTs 



ns.imn cysleine-ricli protein 2 
Hs.50868 solute carrier family 22 (organic cation 
Hs.227277 sine ooulis liomeobox (IDrosopiiila) tiomol 
, 1 (influences h'l_A c 



Hs.31861 
Hs.180711 
Hs.147916 
Hs.198272 
Hs.41714 
Hs.7049 

Hs.50651 
Hs.252748 
Hs.48689 

Hs.176092 



Hs.182362 ESTs 



ESTs 

Homo sapiens, Similar to hypotheiical pr 
DEAD/H (Asp^lu-Ala-AspflHIs) box polypep 
Target CAT 

BCL2-assoclated athanogene 
hypothetical protein FLJ11305 
C12000200:gi|4557225|reqNP_000005.1|al 
Janus kinase 1 (a protein tyrosine kinas 
Homo sapiens cDNA FLJ20394 fis. clone KA 
ESTs 



AW673847 

M639e7 

I^IMJ15958 

AW367283 

AI492298 

AA287801 

AI471987 

AU076845 

F11582 



A1130740 
AI076192 



Hs.169517 

Hs.278270 

HsJ171l 
Hs.173046 
Hs.155596 
Hs.78582 

Hs.6241 
Hs.131933 



AW578849 Hs.2B3552 

BE567620 Hs.99210 

AW1 50796 Hs,92700 

NM_002153 Hs.155109 
AL021154 



aldehyde dehydrogenase 1 family, member 
CGI-30 protein 

zinc finger protein 6 (CMPX1) 
ESTs 

ESTs, Moderately similar to Z1 95_HUMAN Z 
ESTs 

BCLaadenovirus E1B 19kD-interacting pro 
developmentally regulated GTP-blnding pr 
Target Exon 

8b:UI+IF-BP0p-aiw-e-10-0-Ul.r1 NIH_MGC_5 
phosphoinosltide-3-kinase, regulatory su 
C7001460:gl|12667420|gb|AAK01436.11AF332 
ESTs 

Target Exon 

C11000426:gil4507721|ref|l>IP_003310,1ltl 
ESTs, Weakly similar to unnamed protein 



Hs.240079 
Hs,18720 
Hs.184270 



Hs.1 94692 
Hs.77039 
Hs.274539 



Hs.1 47946 
Hs.88414 
Hs.1 29031 
Hs.159424 
Hs^6 

Hs.17519 



C1003562*:gi|l0047l77|dbj|BAB13382.1|(A 
mitogen-aclivated protein Idnase kinase 
Target Exon 
ESTs 

programmed cell death 8 (apoptosis-induc 
capping protein (actiii iilamenl) muscle 
gb;EST1 13074 Fetal brain III Homo sapien 
Target Exon 
cysteine desulfurase 

ATP synthase, H transporting, mitochondr 
Homo sapiens clone 24481 mRNA sequence 
chromosome 14 open Heading frame 4 
opSc atrophy 1 (autosomal dominant) 
BTB and CNC honnology 1, basic leucine zi 
ESTS 
ESTs 

UDP glycosyltransferaae 1 family, polype 
NM_003071:Homosapiens SWIffiNF related. 
Homo sapiens cDNA: FU22539 (is, clone H 
Target Exon 



211 
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Aid 6521 


Hs.71816 


v-akl murine thymoma viral oncogene liomo 


1.09 


1.06 




H86180 


Hs.221513 




1.08 


2.75 




AW815123 




gb:QV4-ST021 2-261 199^)45-b01 ST0212Homo 


1.08 


1.98 




BE304678 


Hs. 119598 


ribosomal protein U 


1.08 






A J 275986 




transcription factor (SMIF gene) 


1.08 


1.00 




AB011128 


Hs.30512 


Homo sapiens mRNA forKIAA0556 protein, 


1.08 


2.45 








matrix metalloproteinase 7 (matrilysin, 


1.08 


215 






gb;tf05c12.x1 NC)_CGAP_Pr23 Homo sapiens 


1.08 






BE564871 


Hs.29463 


centrin, EF-liand protein, 3 (CDC31 yeasi 




2.03 








gb:Hk763-fAdylt heart, Clontech Homo sa 




2.08 




NM_CCC591 


Hs.75627 


CD14 antigen 




0.91 




AL1 57545 




bromodomain and PHD linger containing, 3 




1.00 




AW952124 








1.11 








cyloclirome P450, subfamily HE (elhand- 




2.14 




AW937733 




gb;QV3-DT0045-21010a063^l06 DT0045 Homo 


1.06 


2.03 




AI792660 




KIAA0771 protein 


1.05 


227 




AW293880 


Hs.248367 


lulEGFII protein 




2.00 




AI908006 


Hs.295362 


Homo sapiens cDNA FIJ14459 lis, clone HE 


1.04 


233 


404058 






Target Exon 








AW954243 




KIAA0251 protein 




2.18 








gb:yr1 6f04.r1 Soares fetal liver spleen 








AW410189 


Hs.98074 


itchy (mouse homolog) E3 ubiquitin prole 






440439 


N928ia 




ESTs, Weakly similar to potential CDS [H 


1.04 


2.05 


409209 


AA450160 






1.04 


2.73 




AA1 60000 


Hs. 137398 


ESTs, Weakly similar to JC5238 galactosy 


1.03 




415403 


F07923 


Hs.267<M 


esT-s 


1.02 


2.43 


455591 


BE008018 




gb:QV0-BN0147-2g0400-214-c01 BN0147Homo 


1.02 


Z08 


428491 


AF091035 


Hs. 184627 


KIAA0118 protein 


1.02 


2.81 




BE514982 


Hs.38991 


SlOOcaldurvblndIng protein A2 


1.02 


111 


434540 


NM_016045 


Hs.3945 


CGI-107 protein 


1.02 


2.78 




AI69C080 


Hs.1 28907 


ESTs, Weakly similar b ARIX homeodomain 








AW246428 


Hs.75355 


ubiquitin-conjugating enzyme E2N (homolo 






452436 


BE077546 


Hs.31447 


ESTs, Moderately slmilartgA46010X-lln 


1.02 






AW582568 




gb;RC1-STO278.<)80100-O11-h04 ST0278 Homo 






426955 


AA393669 


Hs.238094 










AA830797 


Hs. 184760 


CCAAT-box-bindIng transcripUon factor 












CX000682:gl|12741327|ref|XP_008833.2|zi 






400296 


AA305627 


Hs.1 39336 


ATP-Wndlng cassette, sub-family C (CFTR 






402001 






Target Exon 






402812 






NM_004930*:Homo sapiens capping protein 






402892 






Target Exon 






403329 






Target Exon 


1.00 


1.00 


407202 


N58172 


Hs.1 09370 


ESTs 






408684 


R61377 




hypothetical protein FU21610 






410555 


U92649 




a disintegrin and metalloproteinase doma 






413573 


AI733859 


Hs.149089 


ESTs 








AL036166 










414422 


AA147224 










417006 


AW673606 


Hs.80758 


aspartyl-tRNA synthetase 








BE465451 


Hs.1 05925 


single-minded (Drosophila) homolog 1 






423349 


AF010258 


Hs.1 27428 








424273 


W40460 


H5.1 44442 


phospholipase A2, group X 






424649 


BE242035 


Hs.151461 


embryonic ectoderm development 






426827 


AW067806 


Hs.172665 








427308 


D26067 


Hs.174905 


KIAA0033 protein 






429597 


NM_003816 


Hs.2442 








430261 


AA3C6127 


H5.237225 


hypothetical protein Hr023 


1.00 




431078 


U82827 


Hs.249195 


homeoboxA13 






433222 


AW514472 


Hs.238415 


dickkopf (Xenopus laevls) homolog 4 


1.00 




434980 


AW770553 


Hs.14553 


sterol 0-aoyltransferase (acyl-Coanzyme 








U29690 


Hs.37744 


Homo sapiens beta-1 adrenergic receptor 


1.00 


1.00 


443054 


A1745185 


Hs.8939 


yes-assoclaled protein 65 kDa 






443564 


A1921685 


Hs.199713 


ESTs 


1.00 


1.00 














445413 


AA151342 


Hs!l2677 


CGI-147 protein 


i!oo 


i!oo 


448708 


AW291095 


Hs.21814 


interleukin 20 receptor, alpha 


1.00 


1.00 


448807 


AI571940 


Hs.7549 


ESTs 


1.00 


1.00 


449448 


D60730 


Hs.57471 


ESTs 


1.00 


1.00 


449517 


AW500106 


Hs.23643 


serine/threonine protein kinase MASK 




1.00 


450568 


AL050078 


Hs.25159 


Homo sapiens cDNA FLJ10784 fis, clone NT 


i]oo 


1.00 


451844 


T61430 




gb:yc06a03.s1 Stratagene lung (937210) H 


1.00 


1.00 


452039 




Hs.172510 


ESTs 


1.00 


1.00 


452795 


AW392555 


Hs.1 8878 


hypothetical protein FU21620 


1.00 


1.00 


463096 


AW294631 


Hs.11 325 


ESTs 


1.00 


1.00 


453370 


AI470523 


H5.139336 


ATP-bindIng cassette, sub-family C (CFFR 


1.00 


1.00 


453966 


BE148734 


Hs.63325 


transmembrane protease, serine 4 


1.00 


1.00 


405580 






Target Exon 


1.00 


1.00 



wo 03/003906 



PCT/LS02/21338 



430268 


AK000737 


HS.23748Q 


hypothetical protein FU20730 


1.00 


1.00 


450377 




Hs.74313 


KIAA1265 protein 


1.00 


1,00 


433226 


AW503733 


Hs.9414 


KIAA1488 protein 








AW016610 


Hs.816 


ESTs 


too 


too 


425289 




HS.155S30 


interferon, gamma-indudble protein 16 


1.00 


1.00 


446921 


AB012113 


Hs. 16530 


small inducll)le cytokine subfamily A (Cy 




1.00 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1 HUMAN DEATH 




1.00 


433817 




Hs. 163242 


ESTs 




1.00 


455474 


AW948094 




gb:RCO-MT0012-290300-03lK}10 II/IT0012 Homo 


0^99 


2.00 


420148 




Hs.95361 


myosin VIIA (Usher syndrome IB (aukBoma 


0.99 




428466 


AF151063 


Hs.1 84456 


hypothetical protein 


0.98 


2.20 


421594 


R45689 




Homo sapiens cDNA FLJ12978 fis, done NT 




3.09 


444654 


AV650572 




K1AA1106 ptolein 


0^98 




409759 




Hs.81 182 


histamine N-raethyltransferase 


0.97 


2^20 


401936 






TaiBBl Exon 


0.97 


2.39 








Target Exon 


0.96 


2.58 


434421 


A1915927 


Hs,34771 


ESTs 


0.96 


2.15 


412636 


NM_004415 




desmoplakin (DPI, DPIl) 


0.95 


2.01 




BE093589 


Hs,38178 


hypothetical protein FU23468 


0.93 


2.55 


421938 


AA405951 




gb:zu66c01.r1 Soares testis NHT Homo sap 


0.93 


3.10 


447470 


BE618324 


Hs.263561 


ESTs, Weakly similar to A53531 oncofetal 


0.92 


2.08 


448369 


AW268g62 


Hs,1 11335 


ESTs 


0.91 


2.35 


421710 


AB007930 


Hs,107088 


KIAA0461 protein 


0.91 


2.63 


406805 


AI686003 


Hs.296031 


ESTs 


0.91 


2.21 


447475 


AI380797 


Hs.158gg2 


ESTs 


0.90 


3.25 


428892 


U82828 


Hs,194382 


ataxia telangiectasia mutated (includes 


0,90 


2.02 


450222 


U75308 


Hs,24644 


TATA box binding protein (TBP)-associ3te 




2.73 


401572 






C15001384*;gi|12737057|rellXP_012129.1| 


a88 


2.00 


429226 


AA913330 


Hs,53542 


choreoacanthocytosis gene; KIAA0986 prot 




2.37 


421979 


AW062518 


Hs.233150 


liypothetical protein UGC5560 


0^87 


3.70 


407614 


NM_001932 


Hs,37144 


membrane protein, palmitoylaled 3 (MAGUK 




2.02 


417912 




Hs,50547 


ESTs 


o!86 


2.00 


401654 






NluL007242:Homosapiens DEAD/H (Asp-Qu-A 


0.86 


2.11 


403149 






NM.001 450:Homo sapiens four and a half L 


0.86 


2.19 


413000 


BE046280 




gb:hn43c09.x2 NCLCGAP_FiDF2 Homo sap'iens 


0,85 


2.40 


425166 


AK001456 


Hs.1 54919 


KIAA0625 protein 


0.85 


2.18 




AA334274 


Hs.18368 


DKFZP564B0769 protein 


0.84 


2.13 


452801 


A1935587 




ESTs 


0,84 


2,55 


400957 






Target Exon 






426420 


BE383808 




NDRG family, member 4 


0.83 




429354 


AA451566 


H3.269363 


ESTs 


0.80 


2.25 








CD47 antigen (Rh-related antigen, Integr 






443368 


BE568891 


Hs!l 99210 


ESTs, Moderately similar to bK116F5.2lH 


0.78 


2.00 


441901 


AI914445 


Hs.1 28103 


ESTs 


0.76 


2.06 


429462 


A1890356 


Hs,127804 


Homo sapiens, done IMAGE;3536432, mRNA, 


0.76 


2.03 


403010 






C21000152:gi|6226483|sp|Q52118|YMO3.ERWS 






420344 


BE463721 


Hs.97101 


potatlve G protein^iupled receptor 


o!75 


2,42 


448332 


AW293110 


Hs.171068 


ESTs 


0.74 


2.00 


421674 


T10707 


H6,296355 


hypothetical protein FU23138 


0.71 


2.18 


450645 


AL117441 


Hs.301997 


hypothetical protein FU13033 


0.67 


2.06 


448514 


AB020626 


Hs.301866 




0.67 


2.03 


431609 


AW792792 


Hs.264330 




0,60 


2.54 


417512 


X76534 


Hs.82226 


glycoprotein (transmembrane) nmb 


0.60 


2.00 


425960 


AW410646 


Hs.164649 


hypothetical protein DKFZp434H247 


0,57 


Z15 


408077 


AL133574 


Hs.42458 


Homo sapiens niRN^ cDNA DKFZp586C1817 (f 


0.53 


2.29 



TABLE 9B 

Pkey: Unique Eos probeset Identifier number 
CAT number: Gene duster number 
Accession: Genbank accession numbers 



409540 
409806 
409838 
409844 

409865 
410600 
410846 
410946 
411456 



121108.1 
1223902 1 
1227589.1 
1246706.1 



AI949109 AW1 1 8631 AI833148 AW1 17891 R4S932 Ra0970 AA7331 10 AW26942a T27024 R97693 AI149202 AI475492 H98846 AA809665 
Al125598AW663742Al140029AW103164AA1268e2N66829W65398 H94509 W65300 R07712 R36955AA812477AA609576Al754304 
Z41 076 AA97O0O4 AW274661 AA923584 AI673108 AA070706 AA541812 T9093a 

AW409569 BE297044 BE295828 
AW600960 AW501 280 AW50081 4 
AW502928 AW505606 AW501576 AW501577 
AW502336 AW502339 AW501736 AW501839 
AW502461 AW503000 AW502207 AW501862 



AW575742 BE549623 AI335824 BE463447 AA729043 AW40871 2 AW49961 6 AA086179 AW499617 AA1 91 322 

mmosr AW807054 AW8071 89 AW807193 AW807359 AW807429 AW807364 AW807365 AW807078 AW807256 AWaO7180 AWa07331 

AW811502AWB11521 AW81 1648 AW81 1471 AW811511 AW811508 

AW84758B AW84771 6 AW847664 AW847592 



213 



wo U3/003906 



PCT/US02/21338 



411609 1251530.1 

412210 12B3615J 

412383 1292509J 

412560 130601.1 



412636 13165J 



413000 
413611 
413709 



13B0017J 

1390710J 
1397743.1 



416272 158407.1 
417500 168443.1 
419556 If""' ■ 



422895 22276.1 
422984 223488.3 



5 5 434809 393566.1 
438129 450500.1 
438713 463722.1 



442473 543413.1 

442506 54405.1 

75 445182 632151.1 

447703 733191.1 



AW901492 AW947725 AW901448 

AW947577 AW947574 AW947576 AW947734 AmiV33 AW947732 

R24601 R23657AA194467A1948584AI678666 AA194383AI765219AA702993AA813511 AA620965A1990303AI624B82A1003925 
AI338870 AI004689 AI004690 AI127228 Z25302 F29302 BEQ44308 F32992 AA112966 T30825 F24958 F18071 C00537T30841 R24502 
A1934786 AI770076 AI144132 AA81 2597 AW203978 H82735 AI81 3349 AI14290a C04894 AI208243 A1208044 N89963 A1767866 A1290470 
Ai865963 H83611 BE463806 AI082639AA507760 

NM.00441 5 AL031058 M77830 BE149760 AW752599 AW848723 AW376697 AW376B1 7 AW376699 AW848371 AW376782 AW848789 
ATO61413AW849074AW997139AW799304AW799309BE077020BE077017BE185187AW997196BE156621 BE179915 BE006561 
BE143155 AW890985 BE002107 AW103521 AA857316 AW383133 BE011378AW1 70253 BE185750 AW886475 BE160433 J05211 
BE082576 BE082584 BE004047 AW607238 AW377700 AW377699 BE082526 BE082505 BE082607 BE08251 4 AW178000 AW1 77933 
A1906935 AW747877 AW7481 1 4 BE14851 6 AW265323 AW847678 AW847688 AW365151 AW3551 48 AW3S51 53 AW3651 56 AW365176 
AW365157 AW365154 AW063840 BEC05272 AW365145 BE0ai925 BE182166 BE144243 BE001923 AI951 756 AI434518 BE184920 
S49M AI284090 BE184941 AW804674 BE184924 C04715 W39488 AW995615 BE184948 BE169646 AW606653 AA099891 AA131 1 28 
AA337270 AA340777 AW384371 AA862212 R58704 AW366566 AW364859 AA025851 AA025852 AA455100 AA719958 AW352220 
S45Te165351 BE07M^^ 

S^mmh^^/mwm ami AW674920 N57176 AA026480 AW576767 H93284 AA026863 AW177787 AA026664 
S86Ba92mS2137BEro^ 

M36ffiB4AW951576^ H90899 N29754 W32490 R20904 BE167181 BE 67165 N84767 

22^3^90^9^003378 AI554403AI205263AA128470AI^^^^^ 

AW798883 AW798569 R33557 AA149190 C03029 AW177783 AA088866 AW370829 AA247685 BE002273 ^^^^^^ 

AI700963 AA45 1 923 AI340326 A1590975 T48793 A1568096 All 42882 AA039975 A14701 46 AA946936 BEC67737 BE067786 W1 9287 

Sl M7 Sl75^^^^^^^^ 

A1915596AI627383AI270345AW021347AW166807AW105614A1346078AA552300 W95070AI494069AI911702M1^^^^^^^^ 
AI830049 AI887258 AW780435 AI910434 A1819984 A1858282 A1078449 A1026g32 A18605M A1635878 M^^^ 
AW1920a5 AA65ai54 AW51 4597 AW591892 T87181 AA782066 AW243815 AW150038 "83 AW^^^^^^^ 
AW473233 AI804485 AW169216 A1572669 AA602182 AW015480AW771865«270027M^^^^^^^ 

AI783914 H44405 AW7991 1 8 AA128330 AA515500 AA91B281 W02156 AI905927 AA022701 W38382 R20795 T77861 AW860878 

BE046254 BE046673 BE046253 
BE046280 BE046763 BE045676 

BE153275 BE153189 BE153329 BE153022BE153030BE152974 

BE1 58687 BE1 58688 

T64682 BE168190BE16a256 

BE178160 BE177986 BE178330BE178480 

H01581 HI 2850 R65905H13053 

AA173029 BE467711 AA176710 

AA1 78882 AA1 79898 AA1 78897 



AA244416AA244401 
AA279362AA454496AA584871 

AA297546 AA297410AA297401 AA297465 AA297268 AW966174 
filoil^''™'iil)88^^^^^^^ 

AlM0687A1206a4 AI961336 T64873 AI736669 H58826 AA857710 AW337576 BE242131 AA339340 AA371380 AA334618 AA316398 
S4Sa™mSVS2AW190386A1129278A19130B1AW473549^^^ 

SsVs^e AW198135 A1287510 A1565910 AA765775 AI86M 
AW339175 r 

AI017717W„ 

S85 r^5800 AW370684 AAg0460B AA806352 AA894757 AW068376 AA807284 AA311341 AW95B314 
AW974687AA64g656AA652145 

2?L'oTS™?LS23569 -39869 r04737AI538427A1051714AW172603 

AI568978 H13317 R20373 AW948724 AW946744 AA335023 AA436722 AA448690 C21404 A™84390 MM^^ 

S78Ari975AlS^ 
S4 H79O A 80 4I M^^ 
M911493A™ 

A1696593 AI658482 AI566601 AW072797 AA1 28047 AA,C35592 AW243274 AA99251 7 R43760 

AI277548 
W27992AF056988 

BE566411 AL121194AW976385AW366882AI767324 

AW189787 Ai215430 AW268499 AW205930 A1392907 BE093017 BE093019 BE093010 
A1420277AW747989 W26565 

214 



wo U3/003906 



PCT/US02/21338 



454412 
454666 
25 454717 



455476 
35 455591 



1230516.1 
1265385J 
1289247.1 



1337197J 
1374629.1 
397222.1 
850804.1 



R76981 M101801 AA101722M122297 M098802BE006483 AA343200AW963375BE090425 AA213871 BE621402 M309094 AW960171 
«275045 Z44230AW243724Al051487Al376624l«8631AW978550a)0116Al984051M122265Al379941AW205843AW205839 
AW190219 AW129532 A1954133 A1668869 A1383948 AI537386 AA213788 A1088416 AI360826 AA101802 AI619505 AA101795 AA101723 
AW614392 R642B7 AA927599 AI251904 A1803003 A16751 23 AW023079 AW134959 AI926156 AA831 069 AI638324 F29901 R450a5 R)3383 
C17731 R20584 T90131 

AI033745 AI034133 AA844424 AW166024 AI831699 AI971097 AA01 1685 H70852 H70851 

Mml^P^^9Pm^mAA■32m^ AI954235 N7IO35 N7O23C AW674412 A1871136A1563955 AI954237 AA649543 AI34023I 
A1368586 A1868721 W4448B T83736 AA126250 A134361 9 H94297 T47633 A1672897 AA496355 R23240 A1814680 AA9021 19 AA6442ffi 
N67Q40 AW074273 A1357512AA865354 AI027942 R33837 H95828 N63928 AI418701 A11864ra AA693OT^^ 

C00016 AA861 853 AW022016 A1955645 A17531 18 A1755Q95 AA029523 T70086 AA029458 AW675640 N79606 A1659597 A141 71 19 A18Q4089 

AI383091 AA128493T83g07T47632AA094118AA496405 H94191AW887759 N98454AA512988Al623761 AW028373AW249740 

A1187029 AA991733 AI6830a5 A1336036 D60550 D60466 D60009 D60248 AW262673 A1524080 AA205M3 W969141 a™749^^^^ 

R81248 R73069W01019 R23315C14688 W44485 AA093663T70157 H57883 R94790 D81287 rai788AM643«AM83633 m^^^^ 

BE2951 44 D60973 D81466 D60551 R33836 D60249 AA354560 W01933 T71478 AA51 5461 F27984 BE084745 W19084 R81247 AI080252 

SM"3M8f9f30AA412478AA828434AA814538Al927418Al192435 W52897AA443666AA031913Al 

D83907 A1206832 AA876122 D83836 D83838 D82633A1761290 A1191125 A1143749 AVV771909 AI241436A1757267 "^SimimVnS! 

AA568692T10502AI247870AA715017AA643304AA890233AA811387AA897470AA907729AI708679AI078010AA452830AW^^^^^^^ 

A1783713 N80205W56778AA676899A1888718 N69930A1338935A1217580AA639508AA575836Kp^^^^^^^^^ 

AA643838 AI493761 AAD32024 W38849 AA340178 AA447052 AA452969 W19369 AA296364 H44229 W58767 C05751 C05B35 A1741989 

N98532AW102617AA412583A1922246 W38495AA355375AA928571 C06275AA352500 N93132 

AW582568 AW818666 AW818647 AW818655 AW818637 AW818234 

AW807095AW807022AW845880AW807096AW807461AW846116AW807070 

AW816123AW815138AW815259 

R41084AWB75856 

aSm aSii 8 aSto AW948093 AW948084 AW948088 AW948074 AW94B079 AW9481 17 AW948085 AW948081 AW94B096 
S 2S S S S^aS 
AWq48 2iSo68aKo9S810^ 
AW94M91aWM8?16S78^ 

Ar948nLw948178AW948169AVV94B176AW948191 AW948192AW948186AW948184AW94B187AVV948188AW 
AW948177 AW948171 AW948183AW94B173 
BE008018 BE008025 BEOOBO26BE0O7959BE00. 
BE00801 5 BE008021 BE008028 BE008023 BE0C8O3O BE008014 
BE011183 BE011170BE011333BE011188BE011181BE011324BE011161 BE011169 
BE145837 BE145894 
AW975000 AA668945 AA661 558 

AW?76f8oZ7B212A1909464AW17621BAW 



arsGenbank Identifier (Gl)nu 
L' Dunham I. el al., Nature (1999) 402;4B9495. 
Strand: Indicates DNA strand from which exonswere predicted. 
NLposlllon; Indicates nucleotide positions of predicted exons, 



55 400752 
400772 
400773 

400789 
60 400835 
400843 
400844 
4C0845 
400846 
65 400847 



401038 
401041 
70 401058 
401155 
401177 
401196 



75 



401286 
401346 
401371 



8131629 
7342055 
8307741 
8954121 



9433503 
9719573 
9743388 
9801342 



101102-101223,101886-102018 
198991-199168,199316-199548 
36215-36461 

34896-35021,41078-41197 
44116-44238,48208-48321 
184369-184715 

82281-83693 



....■5970,7663-7784,8892-9023,9673-9807,10634-10789,15254-1540123827-23958 

24746-24872,25035-25204 
34428-34612 
39310-39474 
44643-44835 



33694-33872 
147036-147318 
12031-13032 
80901-81283 



wo 03/003906 



PCT/LS02/21338 



401383 
401495 
401512 
401563 
401572 
401593 
401600 
401603 
401613 
401654 
401660 
401720 



402144 
402190 
402191 
402329 
402368 
402371 
402378 



7622346 Plus 

8247910 Plus 

8670271 Minus 

7230957 Plus 

4388746 Minus 

7689963 Minus 

4878062 Plus 

9097132 mnus 

9100664 Minus 

6468551 Plus 

7249190 Plus 

7139700 Plus 

3808091 Plus 

9501818 Plus 

8117546 Plus 

7704985 Plus 

7242328 Plus 

8576067 Minus 

8576073 Minus 

7798735 Plus 

9558577 Minus 



155543-157381 

82790-83002 

136399-136557 

91395-91763 

78651-78889 

10368-10572,11293-12356 
27363-27518,28727-28891,: 



22461-22831 
64695-64797 
173662-174024 
7783-8468 

148362-148606,149453-149535,149731-149962 

142257-142742 

79215-79393 

68052-68223 

137069-137213,138678-138828,138969-139050 

14173-15108 

115425-115977 



69410-69583 

15833-16180,18419-18715,22507-22624 
47218-47330,48052-48203 

68736-68956 

41312-41468,48313-48720 



402617 
402712 
402731 
402737 
402754 
402750 



6434643 Minus 

8086844 Minus 

3132346 Plus 

9799833 Plus 

8440026 Minus 

8516120 Plus 

7233487 Minus 

8569773 Minus 

9438267 Minus 



7530978 
3169112 
8218035 



7530858 
3135305 
3135305 



404438 
404439 
404530 



5835-5987 
69466-69945 
10941-11138 
117913-118004, 
13358-13552 
15345-15852 
136829-136952,137336-137521 
25026-25091,25844-25920 
160451-160617,160788-161009 
138639-139335 
194384-194645 
78385-79052 
25034-25185 
110959-111122 
96450-96598 
42312-43750 
167816-168374 
26009-26178 



1110-121211,121327-121457,125478-125623,126540-126663 



86737-86843 

34481-34671 

162963-165773 

121197-121358 

53259-53524 

100742-100904,101322-101503 

16251-16462 

108716-111112 

99397-101808 

47310-47450 

178075-178383 

79868-80321 

61077-61322 

36099-36212,37928-38075 
97789-98285,99601-99866 
122664-122931 
61006-61187 

26030-2617327852-27997 
29738-29857 
104042-104232 
53413-63553 



3157-3304 

104807-105043 

100027-100399 

18677-18993 

128456-128565 

71776-71862,72885-73019,73700-73822,74692-74850 
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404751 
404769 
404790 
404906 



405172 
405180 
405193 
405194 



405418 
405436 
405443 
405474 



7530939 
8099713 
7230958 
7331453 

7337343 
3738341 



9966302 Plus 

9966302 Plus 

9966752 Plus 

7139743 Plus 

7230072 Plus 

7230072 Plus 

7249293 Minus 

2276192 Mirus 

1552533 Plus 

6997292 Plus 

7408068 Minus 

7408143 Plus 

8439781 Plus 

9211311 Minus 

4512267 Plus 

7159256 Plus 

4508116 Minus 

4914350 Plus 



7767812 
8389537 
9168231 



7417725 
7417725 
7543787 
7107907 
9256305 



113799-114252, 

175801-176823 

38611-38761 

100985-101126 

55883-56203 

43081-43229 



217163-217439 

157134-157430 

150740-151556 

130469-130723 

161171-161299 

6461-6845 

40526-40891 



119867-120372,120481-120824,121029-121357 

28S75-29099 

51839-51953 

55716-55979 

90716-90887,101420-101577 
172005-172175 



169232-1B9647 
4758547688 

103218-103291,105858-105993,110051-110126 
741-969 

66203-66832 

224961-226780 

29424-29764 

60377-60795 

54789-55457 

41062-41861 

82322-83110 

123525-123713 

37764-37877 

49887-50219 

16586-16B55 

17206-17641,17772-17968 



182212-182958 

5446-5574,6170-6352 

142034-142473 
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TABLE 10 



Pkey: 



: Genes preterenlially expressed in non-invasive bladder tumors 



Unique Eos probesel identifier number 
ExAccn: Exemplar Accession number, Genbanl< accession number 
UnigenelD: Unigene number 
Unigene Title; Un'igenesene title 

R1 80lh percenUle of Ta lumor Ate divided by Die 80lli percenllle of T2-T4 tumor Ala 



Pliey 
421110 



UnigenelD Unigene Tille 



A1732617 
AA503115 
BE379727 



417275 X63578 



Hs.326444 
Hs.1 82362 
Hs.l 83752 
Hs.83213 



Hs.166196 
Hs.1 23933 
Hs.283675 
HS.8490S 



Hs.83760 
Hs.76549 
Hs.13500 



ESTs 

microseminoprolein. bela- 
fatty acid binding protein 4, adipocyte 
NM J03105':Hon!io sapiens sortilin-rdaled 
guanine deaminase 

C21 0001 52:gi|6226483|sp|Q521 1 8|YM03_ERWS 
Insulin-like growth factor 2 (somalomedi 
solute carrier family 14 (urea transport 



NM.003105*:Honio sapiens sortllin-relaled 
Target Exon 
parvalbumin 
Target Exon 



ENSP00000231844*:Ecotropic virus Integra 
ATPase, Qass I, Vpe 86, member 1 
ESTs, Weakly similar to ZI>igi_HUAMN ZINC 
NPDOOQpro " 



NPD009 protein 
cytol(eratin20 



Homo sapiens HUT1 1 protein mRNA, partial 
' Homo sapiens cDNA FU 12978 fis, clone NT 
Target Exon 

solute carrier fam9y 2 (facllllaled glu 
li4yoD family inhibitor 

troponin I, skeletal, fast 

ESTs, Weakly similar to A561 54 Abl subst 



420729 AW964897 



401512 
446B47 
417094 
436293 
436246 
447578 
417381 
426028 
431448 
437181 
415025 
412610 



T51454 

NM.006895 

AI601188 

AW450963 

AA912347 

AF164142 

Ni^oomo 

AL137517 
AI306615 
AW207091 



AF071202 



413804 
432306 
405364 
414320 
401929 



Hs.82845 
Hs.81182 
Hs.120910 
Hs.1 19991 
Hs.136585 
Hs.82042 
Hs.172028 
Hs.306201 
Hs.125343 
Hs.72307 
Hs.74126 
Hs,139336 
Hs.l 21 575 
Hs.188684 

Hs.61265 

Hs.303090 

Hs.75893 



hypothetical protein DKFZp56401278 
Target Exon 

Homo sapiens cDNA; FLJ23523 fe.ctone L 
ESTs 

NH^_003105';Homo sapiens sortilin-related 
deafness, autosomal dominant 5 
ti|M_014080;Horno sapiens dual oxidase-like 
Homo sapiens cDI«\: FU21930 fis, clone H 



ESTs 
ESTs 

ESTs, Weakly similar to JC5314 CDC28to)c 
solute carrier family 23 [mideobase tra 
a disintegrin and metsHloproteinase doma 
hypothetical protein DKFZp56401278 
ESTs, Weakly similar to KIAA0758 protein 
ESTs 

m add binding protein 6, Ileal (gas 
ATP-bifiding cassette, sub-family C (CFTR 
Homo sapiens cDNA FU12231 fis, clone MA 
ESTs, Weakly similar to PO)259 ferrilin 
gb:iim30f03.xl NCI_CGAP_Thy4 Homo sapiens 
ESTs, Moderately similar to G786_HUMAN P 
gb;yo48b02.r1 Stralagene liver (937224) 
protein phosplialase 1, regulatory (inhib 
ENSP00000239138*:Guaninenucleotkle-bind 
ankyrin 3, node of Fianvler (ankynn G) 
C17001690:gi|6005701|r6f|NP_009099.1|AT 
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TABLE 10C . 

Pkey; Unique number corresponding to an Eos probesel 

Ref: Sequence soune. The 7 digit numbers in this coiuran are Genbank Identifier (Gl) numb 

sequence of tiuman chromosome 22." Dunham I. et al, Nature (1999) 402:489-495. 
Strand; Indicates DNA strand from which exons were predicled. 
NLposilion: Indicates nucleotide positions of predicted exons. 



n 1. ei al." refers b llie publication entitled The DNA 




Nljosilion 
36215-36461 

5863-5970,7653-7784,8892-9023,967J-9807,10634-10789,15254-15403,23827-23958 



22335-23166 



3167-3288,4216-4310 

29782-29932 

54958-5531'3 

78385-79052 

26009-26178 

119837-121197 

138360-138512,144656-14' 



43081-43229 

48325-48491,49136-49252 
38115-38691 
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TABLE 11A: Genes preferentially expressed In musde-invasive bladder tumors 

Pl(ey: Unique Eos probesel identifier number 

ExAccn: Exemplar Accession number, Genbanlt acaiesslon number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 80tli percentile of T2-T4 tumor Als divided by ttie 80th percentile of Ta tunnor Als 

Pkey ExAccn UnigenelD Unigene Title 

423961 D13666 Hs.1 36348 periostin (OSF-2os) 

421948 L42583 
401780 

446619 AUC76643 

444381 BE387335 



NM_00555r;Homo sapiens keratin 1 6 (fboa 
secreted phosphopiolein 1 (osteopontin, 
ESTs, Weakly similar to S64054 hypotheti 



408243 Y007B7 

414183 AW957446 

411573 AB029000 

414522 AW518944 

41 3083 AL035737 

441633 AW958544 

427337 Z46223 

420859 AW468397 

422168 AA586894 

418870 AF147204 

421116 T19132 

448429 D17408 

414020 Nmi_002984 

447526 AL048753 

424247 XI 4008 

456625 AW468397 

418007 Ml 3509 

406663 U24683 

425593 AA278921 

433336 AF017986 

425118 AU076611 



445921 AB012113 

433470 AW960564 

417880 BE241595 

446500 U78093 

413324 V00571 

436729 BE621807 

450455 AL117424 

413731 BE243845 

412429 AV650262 



Hs.624 

Hs.301711 

Hs.70823 

Hs.76325 

Hs.75164 

Hs.1 12242 

Hs.1 76663 

Hs.100000 

Hs.1 12408 

Hs.89414 

Hs.1 01 850 
Hs,21223 
Hs.75703 
Hs.303649 
Hs.234734 



Hs.31386 
Hs.l 54672 
Hs,78944 
Hs,73817 
Hs.l 12341 



420899 NWL001629 

400288 X06256 

437446 AA788946 

413441 AI929374 
404854 

431319 AA873350 

452432 AW206008 

429679 NM_006290 

428330 L22524 

408380 AF1 23050 

431103 11^57399 

422545 X02761 

418203 X54942 

409956 AW103364 

406687 M31126 

414369 IW62194 

417259 AW903838 

417497 AW402482 

449335 AW150717 

445033 AV652402 

427274 NM_00521 1 

427527 AI809057 

4C9142 AL136877 

453331 AI240665 

428036 AW068302 

417366 BE185289 



Hs.82848 
Hs.15154 
Hs.75294 

Hs.26035 
Hs.75511 
Hs.75765 
Hs.83942 
Hs.83968 
Hs.l 001 94 
Hs.1 49609 
Hs.l 01 302 
Hs.75367 



KIAA1 077 protein 

step II splicing factor SLU7 

chitinase 3-like 1 (cartilage glycoprote 

Fc fragment of IgG, low affinity lllb, r 
SI 00 calcium-binding protein A8 (calgran 
SI 00 calcium-binding protein A7 (psorias 
chemoklne (C-X-G motif), receptor 4 (fus 
Target Exon . 

retinol-blnding protein 1, cellular 
calponin 1, basic, snnoath musde 
small inducible cytokine A4 (homologous 
small inducible cytokine A2 (monocyte di 
lysozyme (renal annyloldosls) 
SI 00 caldum-binding protein A8 (calgran 



proteoglycan 1 , secretory granule 
secreted frizzled-related protein 2 
methylene tetrahydrofolate dehydrogenase 
regulator of G-protein signalling 2, 24k 
small inducible cytokine A3 (homologous 
protease inhibitor 3, skin-derived (SKAL 
small inducible cytokine subfamily A (Cy 
transmembrane 4 superfamiiy member 1 
selectin L (lymphocyte adhesion molecule 
sushi-repeat-containing protein, X chrom 



chiorkle Intracellular channel 4 
connective Ussue growth fedor 
GR02 oncogene 
cathepsin K (pycnodysostosis) 
Integrin, beta 2 (antigen CD18 (p95), ly 



Hs.75929 
HS.8180C 
Hs.82212 
Hs.345728 



integrin, alphas (llbronectin receptor, 

ESTs, Moderately similar to CA1 C FiAT COL 

Srolike-adapter 

Target Exon 

ESTs 

Homo sapiens cDNA: FU21778 fs, done H 
tumor necrosis factor, alphaJnduced pro 
matrix metalloproteinase7 (matrllysin, 

diubiquitin 

pleiotrophin (heparin binding growth (ac 

fibronectin 1 

CDC28 protein kinase 2 

inhibin, beta A (activin A, activin AB a 

matrix metalloproteinase 1 1 (stromelysin 

cadherin 1 1, type 2, OB-cadherin (osteob 

chondroitin sulfate proteoglycan 2 (vers 

CD53 antigen 

STAT induced STAT inhibitor 3 
mucin 13, epithelial 'jansmembrane 
colony stimulating factor 1 receptor, fo 
Immunoglobulin heavy constant mu 
SMC4 (structural maintenance of chromoso 
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406755 
426663 
443623 
443907 
422048 
410204 
438973 
420202 
422626 
442402 
413902 
434868 
407207 
438855 
430413 
20 424909 



15 



N80129 
AA530892 I 
M345519 1 
AU076484 1 
Nf«L012445 I 
AJ243425 
AW95g5D3 
AL0365S7 
AA344932 
NM.000954 
AU076743 



Hs.1174 
Hs.82647 
Hs.297681 
Hs.94360 



HS336780 
Hs.6441 
Hs.241392 
Hs.1 53752 



Je N-methyllransferase 
cyclin-dependent kinase inhibitor 2A (me 
retinoic acid receptor responder (tazaro 
serine (or cysteine) proteinase inhlblto 
metallothionein 1L 
dual specificity pliosphatase 1 
complement component 1 . q sulwomponent, 
TYRO protein tyrosine Idnase binding pro 
sponiin 2, extracellular matrix protein 
early growtli response 1 
ESTs, Weakly similar to serin protease w 
putative lymptiocyte GO/GI siMitctigene 
metallotliionein 2A 

prostaglaidin D2 syntiiase (21kD, brain) 
CD36 antigen (collagen type 1 receptor, 
collagen, type VI, alpha 2 
tubulin, beta polypeptide 
Homos^ens mRNA; cDNA DKFZp586J021 (» 
small inducible cytokine A5 (RANTES) 
cell diviskm cyde 25 




414420 
415149 
415213 
421848 
452516 
427676 
415198 
424390 



AI742501 
NM_001425 
NM_002118 
AA043424 
XI 2451 
NM_002933 
XI 5880 
AA058630 
AA394062 



Hs78056 

Hs78224 

Hs,108885 

Hs.29759 

Hs.300772 

Hs.943 

Hs.1 82241 

Hs.297753 



epithelial membrane protein 3 
major histocompallbillty complex, dass 
immediate early response 3 

cathepsin L 

ribonuclease, RNase A family. 1 (pancrea 
collagen, type VI, alpha 1 

RNA POLYMERASE I AND TRANSCRIPT RELEASE 
tropomyosin 2 (beta) 
natural killer cell transcript 4 



Pkey: Unique Eos probesetldentlfler number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



AiSSfifiSlffl^ H97060 W03910 H94687 TB3984 AL048165 T29632 

ASSsr9:5SiSS^^ 

A f4M5™ 

tuol^l H^^I R RF wqqmsi 13 ^ D58229 C04082 W45394 AW796667 R73973 

Awfinfin74 AWISSSflq ^^944 AA665817 AW192979 AW469065 AA564048 H84715 C16417 AA731072 AA661674 C16487 N2S477 
S997Srci64yiM65^^^^ 

Alfi?«52Srai S78AW Z'«l"9 C16577 AA885045 AW073763 R45484 

^ S4S^lS6T^^^^^^^ 

AW793910 AL047737 AV659047 AV659632 A1760389 AA092053 AA092798 H85367 T61597 R23745 Z2041B T7B485 AI751 528 AW068121 
AA8531 88 AI752459 AA85371 1 AW950663 R78964 R36359 R21 626 R21 522 
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TABLE 11C 



AW960564AA092457 T55890 D66120 T92525AI815987BE182608 BE1B2595 AW080238 M90657 AA347236 AW9616B6 AW176446 
AA304671 AW5B3735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 D82638 W25540 W52847 
D82729 D58990BE619182AA3151B8AA308636AA112474 W76162AAOB8544H52265AA301631 H80982AA113786BE620997AW651691 
AA343799 BE613669 BE547180 BE546656 F1 1933 AA376800 AW239185 AA376086 BE544387 BE619041 AA452515 AA001 806 AA190873 
AA180483 AA159546 F00242 A1940609 A1940602 AI189753 T97663 T661 10 AW062896 AW062910 AW062902 A1051622 AI828930 
AA1 02452 A1685096 AI819390 AA557597 AA383220 a:B04422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 
AA08361Q AW6291 09 AW51 3200 AA921 353 A1677934 Al 1 48698 AI95585B AA173825 AA453027 A1027865 AW375542 AA454099 AA73301 4 
AI591384R79300 R80023AA843108AA625058AA844898AW375550AA889018A1474275AW205937A1052270AW388117AW38B111 
AA699462 A1242230 N4747B H38178 AA366621 AA1 13196 AA1 30023 H39740 T61629 AI885973 AW083671 AA179730 AA305757 AI285455 
N83956AA216013AA336155AW999959 T97525AA345349 T91762AA771981A1286092AI591386BE392486BE385852AA682601 
AI6B2884AA345B40T85477AA29294SAA932079AA098791 D82607 T48574AW752038 C06300 

BE621807 AI445461 A1346836 A14E3743 AI554544 A192B364 AW984527 BE156214 AI694111 AI591358 C17504C17476 C17963 C18304 
AWD71 625 AI678712 CI 7732 D57559 H6 " 762 A72G939 AI262930 H27252 AA47971 2 AI927769 AA291 466 AA155661 A1963432 AI567995 
AA421678 AI925607 M292956 AA192448 AW1 92593 AI865838 AI696905 AI871950 AI91 1 921 BE619741 BE439796 A1161312 AI597801 
AI424384 A109351Q A1240988 AWB20230 A1492554 BE044033 AW262737 AW00857O AA043216 AW629505 AA1 36645 AA037722 AA706C57 
AA088439AW806193AW806183AA479834BE501957AA129574 R38114AA649494AA524526 BE327120 AW572531 BE219784BE349186 
AW015724 AA043217 AW772000 AI79981 4 AI671727 AW779725 AA502832 AI470033 AA1 29575 W3B1 61 A1972739 AA404570 AA627686 
AA723200 AA1 47228 AA903050 AI990245 AI075878 T32487 C061 23 AA1 57944 A1B001 06 W60075 A1859 160 AA478328 AW673162 
AA182640 AI99D827 AW275048 AW103470 AI298935 AW471421 R791 90 AW085158 W4541 0 A1333170 AW300456 AA662517 T55840 
A1823466 A1692846 AA962397 AW1 91 997 AA1 36658 AI251 817 BE0441 34 AW3391 04 AW517762 AA724739 R79933 AA41 1 00 Ml 91 349 
AAD37696 AA190966 AA757735 AW772283 AA01C631 HB0983 A1769516 H64985 A1061O65 A1950693 AA085492 AI245632 H28594 
AW088968 BE156360 AI349390 A1621 320 AI73e844 AW1 B4272 AA1 48284 AA953883 C06365 AA487893 A192721 7 A191 8523 A1453453 
A1798502 AI189366 AI261359 AI032569 AW338678 A1972899 A1500576 A1B72628 AI693030 Z28771 ^'986583 AI363B29 AW 
AA581 093 AI650338 W60032 AA603686 A1686240 AW242958 AA719173 A174571 7 AW675302 A1582462 A1244845 A1565439 F09679 

S751oT8a640A1468237AW3B4233AA989662AI^^^ 

A1348877 A1798415 D11921 A1377696 A1983666 AI744233 C061 1 1 f^^^^J^?*^^,^^^^ ,,,, , „„ aa41 0586 148869 
Al240665 T53681 N77468 H51B33AA147247 R75732C18450 R73999A!095755 T49g04H03868AA411580 R33395AA410586T48B69 

S2R31981H12M6^^^ 

A1378620 T48870 R73906 R75632 H03612 AA9096B4 N50695 H02580 H12B39 N58781 AA742532 AI360919 H03502 BE208298 R68588 
S63SaW069127AA411621R25671 R36105H12451 HC3869 H51263AA036486 R25109 R25110AA147 



Pkey; Unique number corresponding to an Eos probeset ^ ^ , 

Ref Sequence source, Thr7 digit number In this column are Genbankldenliller(Gl) numbers. "Dunhaml.e 

sequence of human chromosome 22." Dunham I. el al., Nature (1999) 402:489-495. 
Strand; Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons, 



1.- refers to the publication enlitled The DNA 



Pkey Ref Strand NLposlkir 

401780 7249190 Mnui 

401781 7249190 Mlnui 
404854 7143420 Plus 



28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 
83215-83435,83.^- - , o.,., o«««m(. j> 
14260-14537 
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TABLE 1 2A: Genes prefefenlially expressed In inuscle-lnvaslve bladder Uiinois 

Pkey: Unique Eos probesetldailllier number 

ExAccn; Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title; Unigene gene lilie 

Seq ID No,: Sequence idenlilication Number linking Infcrmallon In Table 12A to sequences In Table 13 



NM.002206 
X0782O 
M13509 
AW388e33 
M21305 
AF0560B5 
AA321649 
AW248508 
NM_005940 
U31875 



429276 
428227 
410102 
425247 
424735 
420159 
415511 A1732617 



408908 
443171 
432829 
410553 



421110 
408243 



Hs.1 82362 
Hs. 108106 
Hs.752g4 
Hs.1 43655 
Hs.1 37007 



AW016824 

U60669 

AC002115 

Y13645 

BE001836 

AK001666 

AW450979 

AJ250717 

Z43948 

Y00787 



AKD01379 
AI133461 
T646B2 
A1126096 

AI149332 



Hs.1355 

Hs!624 

Hs.125395 

Hs.72222 

Hs.89404 

Hs.166196 



H5.14855 
Hs.1 80665 
NM_007019 Hs.93002 



427239 

424008 
414907 
421379 
428023 
417079 

422247 
449722 
423673 
418478 



Unigene Title 

Integrin, alplia 5 (fibronectln receptor, 
matrix melalloproteinase 10 (siromelysin 
matrix melalloproteinase 1 (intersHflal 
solute oan-ier famfy 7, (oalionio amino 
FGENES predicted novel secreted protein 
G prctein-coupted receptor 51 
small Inducible cytokine subfamily B (Cy 
ESTs; homologue of PEM-3 [Ciena savignyi 
matdxmetallopiQlehase 11 (siromelysin 
short-chain alcohol dehydrogenase family 
Homo sapiens cDNA: FU21245 fis, done C 



ESTs 



9b:an03c03,x1 Stralageneschizo brain S 



TONDU 

ESTs 



hypolhetical protein IVIGC14128 
cyloohrome P450, subfamily XXW (vitamin 

uroplakin 1A 
uroplakin 2 

ESTs, Weakly similar to dJ365012.1 |H.sa 
similar to SALL1 (sal (DrosophilaHHie 
gb;UI-H.BI3-ala-a-1 2-0-Ul.s1 NCI_CGAP_Su 

cathepsin E 

cartilage acidic protein 1 

interleukin 8 

ESTs 

hypollietical protein FU13459 

msfi (Drosopliila) homeobox homotog 2 

ATPase, Class I, type 8B, member 1 

C1002652*:gi|544327|sp|a04799|FTVtO5_IWBIT 

apolipoprolein L 

death receptor 6, TNF superfamily member 
NM.003105*;Homo sapiens sortilln-related 
NM_022819*:Homo sapiens phospholipase A2 
hypothetical protein FU10549 
Insulin-like growth laclor2 (somalomedi 
gb:yc48ba2.r1 Stratagene liver (937224) 
FGENESH predicted RNaseH domain-containi 



Hs.293317 
H6.261677 
Hs.54451 



• protein E2-C 
Predicted cation efflux pump 
C19a00735':gi|4508027|ref|NP_003414.1| 2 
down syndrome critical region protein DS 
ubiquitin carrier protein 
interferon-stimulated protein, 15kl3a 
putative chemokine receptor; GTP-binding 
polo (DrosopMa)-IIke kinase 
small Inducible cytokine subfamily B (Cy 
a disintegrin and metaHoproteinase doma 
interleukin 1 receptor antagonist 
C12000536';gi|6330167|dbj|BAA86477.1|(A 
solute carrier family 1 (high afjnity a 
cyclin 81 

matrix metaHoproteinase 12 (macrophage 

cyclin-dependent kinase inhibitor 2A (me 

G protein-coupled receptor 87 

hyaluronan synthase 3 

cancer/lestis antigen (NY-ESO-1) 

ESTs, Weakly simlar to GGCI.HUMAN G AI4T 

G anilgen 7B 

lamlnin, gamma 2 (nicein (lOOkD), kalini 



Seq ID No. 9 & 10 
Seq ID No. 11 & 12 
Seq ID No. 13 & 14 
Seq ID No, 15 & 16 
Seq ID No. 17 & 18 
Seq ID Na 19 & 20 
SeqlDNa21&22 
Seq ID Na 23 -25 
SeqlDNo,26&27 
Seq ID Na 28 8 29 
Seq ID Ma 30 
Seq ID No. 31 & 32 
Seq ID Na 33 
SEK! ID No. 344 35 
Seq ID Na 36 S 37 
Seq ID Mo. 38 & 39 
Seq ID No, 40 S 41 
Seq ID No, 42 8 43 
Seq ID No, 44 8 45 
Seq ID No. 46 8 47 
Seq ID No. 48 8 49 
Seq ID Na 50 & 51 
S«| ID No. 52 
Seq ID Na 53854 
Seq ID No. 55- 60 
Seq ID Na 61 8 62 
Seq ID Na 63 8 64 
Seq ID No, 65 8 66 
Seq ID Na 67 8 68 
SeqlDNa 69 8 70 
Seq ID No, 71 8 72 
Seq ID Na 73 8 74 
Seq ID Na 758 76 
SeqlDNa 77 a 78 
Seq ID No. 79 8 80 
SeqlDNa 81 8 82 
Seq ID No. 83 8 84 
SeqlDNa 85 8 36 
SeqlDNa 87 -89 
SeqlDNa 908 91 
SeqlDNa 92 8 93 
SekiIDNo.94895 
Seci ID Na 968 97 
Seq ID No. 98- 100 
Seq ID Na 101 8 102 
SeqlDNa 103 8 104 
Seq ID No. 105 8 106 
SeqlDNa 107 8 108 
Seq ID No. 1098 110 



Seq ID Na 
Seq ID No, 
Seq ID No, 
Seq ID No. 
Seq ID No, 
Seq ID Na 
Seq ID No. 



118112 
138 114 



120 & 121 
1228 123 
124 8 1 25 
126-133 
SeqlDNa 134 8 135 
Seq ID No. 1368 137 
SeqlDNa 138- 141 
SeqlDNa 1428 143 
Seq ID No. 1448 145 
SeqtDNa146 8 147 
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433817 
421478 
431515 
433169 

432441 
425726 
448045 
431846 
437044 



NM.002497 Hs.1 53704 



409103 
129404 
403047 



AW292425 Hs.163484 ESTs 



AJ297436 Hs.20166 

BE019924 Hs.271580 

AL035864 Hs.69517 

BE387335 Hs.283713 

AA242758 Hs.79136 

BE153855 Hs.61460 

AI127076 Hs.306201 

AF104032 Hs,184601 



ESTs, Modaralely similar to S29539 ribos 
EDG-7 (endothelial diRerenllatlon,lys 
klnesMke protein 2 



BE270266 Hs.82128 

NM_014400 Hs,11950 

BE0621C9 Hs.241551 

NM.001944 Hs,1925 

AF251237 Hs,112203 
A1267700 

BE246502 H5,9598 



ESTs, Weakly similar to S64054 hypotheti 
LIV-I protein, estrogen regulated 
Igsupetfamily receptor U4IR 
hypothetical protein DKFZpS6401278 
solute carrier femily 7 (catlonic amino 
gap junction protein, lieta 5 (connexin 3 
mfdkine (neurile growSi-promol/ng Cactor 
EGF-like-domaIn, multiple 6 
periostin (OSF-2os) 
5T4 oncofetal trophoblast glycoprotein 
GPI-anchored metastasis-associated protein homdog 
dibride channel, calcium activated, MIy member 2 
desmoglein 3 (pemphigus vulgaris antigen) 
XAGE-1 protein 



SeqIDNo. 148 & 14 
Seq ID No. 150 & 15 
SeqlDNo,1S2&15: 
Seq ID No. 154 8 151 
SeqIDNo. If- - - 
Seq ID No. 16 
Seq ID No, 1£ 
SeqIDNo. 16 
SeqIDNo. 16 
Seq ID No. 16 
Seq ID No, 16 
Seq ID No, 16 
Seq ID No. 17 
SeqIDNo. 17 
Seq ID No. 17 
Seq ID No. 17 
SeqIDNo, 17 
Seq ID No, 18 
SeqIDNo, 
Seq ID No. 
SeqIDNo. IS 
Seq ID No. IB 
Seq ID Mo. 19 
SeqIDNo. 19 
Seq ID No, 19 
SeqIDNo. 196 & 197 
Seq ID No, 198 & 199 
Seq ID No. 200 & 201 
Seq ID No. 202 
Seq ID No, 203 & 204 
Seq ID No. 205 & 206 



184 & • 



Unique Eos pmbesetidentirier numl»r 



Accession: Genbank accesskxi nu 



Pkey CAT Number 

413804 1390710J _ 

427239 27647.1 BE270447 AW409921 BE207288BE2C7170 D56355 BE263223BE40B171 BE262243BE392439BE29273BBE261776BE314300BE2B7719 

BE268715 BE513876 BE295291 BE297066 AA210923 BE407519H51344 BE622905 AW248281 AW250313 T19021 AA355115 AA316879 
BE269633BE621936AA290724AA380031 AA9S361d BE301331 W79220 H73020AW797850BE274611 BE314315BE542407BE562063 
BE312868 BE27BB17 BE252S03 AA426470 14191670 NM 014501 BE264533 AI884863 AI884g42 AI678077 AA829937 AI869333 AW273060 
AA906924A1571293AW172642AA582906AW674596AW300936AW615753AW615587AW665495AI859296AA594105AA928110 
AI038859 AI589898 AA828316 AA938955 AA28371 1 AW170665 BE047759 AW73212a A)015067 AW070748 AW248955 AA975490 
AA21 1097 AI826097 AA826730 AAgg4072 AI367867 W74508 AI0B6474 AA8Z7720 AI92258g AW000868 W58033 AA991 461 BE0481 31 
liei21BAA876186AA661759AI123879AI089508AW272915Al566244AW245061AA68B299AW250988Al953468AI891144AW089131 
A1471577AA293354AM64019 BE044549 T29587AW958171 BE537716BE269400AA918328A1538087 AA969243AW075033 BE139361 
AI3405B9AI260128AI247038BE138953AW075177AI30720BBE049086AW302327AI054335AI316666AI334881AI252076 AI264494 
AW075006AW302733AW302738AI054057AI054217A1054172AJ054302AI053722AI054060A1054079AW075181 A1307473A1312145 
A1340734AI334909AW071374BE138502AW074809AW301901AI251662AI307559BE139228AI254764AW073049A1251254A1802837 
AW271867AI307442AW075100AW073456AW072496AI270787AW271039AW071307AI610913AW071289AI251232AI261262 
AW073656 AW072901 AI307493 AI255068 AI2S1289 AI252160 AI271496 AI2524Z7 AW073469 AW071420 AI2701S6 AI252926 AI252839 
A1252868 AW072520 AW073433 AI340643 AI802854 AI334733 AI583896 AW071 31 1 AI802853 AI28971 1 A!345(B6 AW07251 3 ^348921 
AI307478/\1305762BE138315AW271034AI334886AI340619AW470478BE139260BE138965AW3020B5BE041872AW268964 

432222 343347 1 AI204995AW827539AW969908 AW440776AA52B7S6 

436608 42361_3 AA628980AI126603BE504035 

438817 46559^.1 AI0Z37g9AA826307A1683Og4AI3D7373AI870547AW97gaO7 
444163 593658_1 A1126098A1184746AI148521 

456034 142696 1 AW4S0979 AA136653 AA136656AW419381 AAg84358AA492073BE168g45 AA809054AW238038 BE011212BE011359BE011367 
BE011368 BE011362BE011215 BE011 365 BE01 1363 

TABLE 12C , 

Pkey: Unique number corresponding to an Eos probeset 

Ref; Sequence source. The 7 digit numbers in iWs column are Genbank Identifier (Gl) numbers. "Dunham 1, et al." refers to the publicatton entitled "The DNA 

sequence of human chromosome 22,' Dunham I, et al., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were piadfcted. 
Ntj)ositk>n: Indicates nucleob'de positions of predicted axons. 



Pkey Ref Strand Ntjjositlon 

400844 9188605 Plus 24746-24872,25036-25204 

401093 8516137 Minus 22335-23166 
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402305 7328724 Plus 40832-41362 

404440 7528061 Plus 80430-81581 

404875 9801324 Plus 96588-96732,97722-97831 

405033 7107731 Minus 142358-142546 
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HHt,i4i^L.uuuC GCTATGGGGA GCCGGACGCC k^r^,^^-.-. — ^ 

G CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCIG TTGCTGCTCG 
TGCUGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCKA GCSGTACTCT 
CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 
GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCRG CCAGCCAGGA GTGCTGCAGG 
GTGGTGCIGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 
TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 
CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCX3AGCCCAT GGCTCCTCCA 
TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 
TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAOIAT GCACCCIGCC 
GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCITC AGTGCCX3AGT 
20 TCACCAAGAC TGGCOGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCIGG CAAGGCCAGA 
TCCTGTCTGC CACTCftGGAG CAGATTGCAG AATCTTATTA CCCCX3AGTAC CTGATCAACC 
TGGTTCAGGG GCAGCIGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC AGCTACCTAG 
GATACrCTGT GGCTCSITGGT GAATTCftGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 
TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA GACATTCGAT 
25 CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCR : 

CAGAOSTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 
GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG CAGCACCCAG 
CCGGCATAGR GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGRGTTT GGCCGATTTG 
GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 
30 GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 
GAGGGCTGGG CTCIAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 
CAGACITCTT TGGCTCTGCC CTTCGRGGRG GCCGAGRCCT GGATGGCAAT GGATATCCTG 
ATCTGATTGI GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC OGCCCCATCG 
TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 
35 GCAGCTTAGA GGGGAACCCT GTGGCCTGCR TCAACCTTAG CTTCTGCCTC AATGCTTCTG 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGOMSAAGC 
AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 
CCCIGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAQGA 
ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC TTCICCTTGG 
40 ACCCCCAAGC CCCAGTOGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 

GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCIGTGTGC 

CTGACCTGCA GCTGCAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 
CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGGG TGGCGCCTAT GAGGCTGAGC 2100 
TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACaC COWGGAACT 2160 
45 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG COGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTCGGTTT ACAGTCCCTC 2280 
ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCT^GC AAGAATCTCA 2340 
ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 
CCCTGAACGG IGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 
50 GAGACCAGCC TCAGAAGGAG GAGGACCIGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 
TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 
TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 2640 
ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CflCCAGCAAA 2700 
AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG AAATGCCCGG 2760 
55 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2B80 
TTAGQCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 
GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 
GCAGCTATGG CGTCSCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 
60 TAGGICTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAA03CTCC CTCCCATAIG 3120 
GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC T^GTCCTCC 3180 
CAATTTCaGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAAGGA 3240 
GGGGICTGGG TACTTCTTGA AQGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 33 00 
GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 33 60 
65 GCACTGTGAA GGRCCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 
GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 34 80 
GAARGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGflCTTCC 3540 
TGTCCAGCTC CAACKTGCAA AGATCTGTCC TCAGCCTTGC CA6AGATCCA AAAGAAGCCC 3 6 00 
CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCIGGACAG CCCCACCCTG 3 660 
70 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GAQCAGCTCA GGACAGATGC 720 
CACAAGGATA GATGCIGGCC CAGGGCCAGA GCCCftGCTCC AAGGGGAATC AGAACTCAAA 3 7 BO 
TGGGGCCXGA TCCAGCXTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 3 840 
CCAATCCAGG C^GATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3 900 
AG??CA™CT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCIGAAGGGG CCATGATGGC 3960 
7S AftCAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCICTTCCCC AGAGAAAGGG 4020 

^gcc^Sg T^CCT GCAGAATTTG GGTTCTGCCT GCXIAGCTGCA CIGATGCTGC 4 8 
CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACXCCCAG GACTTATTTA 4140 
AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCX: ACTGACCAGA ACTAGAAAAA 4200 
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I I I I 1 I 

MGSRTPESPL HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFF 
GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP lEFDSKGSRL 
LBSSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTBKEPLS DPVGTCYLST 
DNPTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYPKQ GQILSATQEQ 
lABSYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 
GYVTILHGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 
EVGRVWYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDODGYNDV AIGAPFGGET 
QQGWPVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 
VDKRWYRGR PIVSASASLT IPPftMPNPEB RSCSLEGMPV ACINLSFCLN ASGKHVADSI 
GPTVELQLDW QKQKtKVRHA LFLASRQATL TQTLLIQNGA REDCREMKIY LENESEPRDK 
LSPIHIALNP SLDPQAPVDS HBLRPALHYQ SKSRIEDKAQ ILLDCGEDNI OTPDLOLEVF 
GEQNHVYLGD KNALNLTPHA QNVGBGGAVB AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 
FAVNQSRLLV CDLGNPMKAG ASLWGGLRPT VPHLRDTKKT IQFDFQILSK NLNHSQSDW 
SFRLSVBAQA QVTLNGVSKP EAVLPPVSDW HERDQPQKEE DLGPAVHHVY BLINQGPSSI 
SQGVLBLSCP QALBGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKaEAPSRS 
SASSGPQILK CPEAECFRBR CBLGPLHOQB SQSLQLHPRV WAKTFLQREH QPFSLQCEAV 
YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGI. LLLGLLIYIL 
YKLGFFKRSL PYGTAMBKRQ LKPPATSDA 



Seq ID NOi 3 DMA s 



Lagaaggta AGGGCAGTGA GAATGATGCR TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGRTCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC ASITTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCTGACGT-l- GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACqCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG MTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA QGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGRCT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGR 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGA^C=EE! 

GGIGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG '^TCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG A«GATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCR TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCQTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGBAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGA.AAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 13 2 u 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA ^TGGATCATC 138 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

G™S??^ T^aCCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA ISOO 

ATTATTCATC TAATOTATTA TCAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

^SaAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT ™TCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAM ^TGAGAGA 1680 

ATGTATTTTC ATAGATGTOT TATTACTTCC TCAATARAAA GTTTTATTTT GGGCCTGTTC 1740 



MLAFLVLL LpVCSAYPL SGAAKBBDSN KDLAQQYl^aC YYNLEKDVKQ FRR^SNLIV 
KMQGMQKPL GLEVTGKLDT DTLBVMRKPR CGVPDVGHPS SPPGMPKWRK THLTYRIVNY 
TPDLPRDAVD SAIEKALKVW EEVTPLTFSR LYBGBADIMI SPAVKEHGDP YSFDGPGHSL 
I^YPPGPGL ?GD5iFDDDE KWTBDASGTN LPLVAAHELG HSLGLFHSAN TBiALMYPLYN 
SPTELAQFRL SQDDVNGIQS 1.YGPPPASTB EPLVPTKSVP SGSEMPAKCD PALSFDAIST 
LRGEYLFF^ R^F^RRSHWN PEPEFHLISA F«PSLPSYU) AAYEVNSRDT VPIFKGNEFW 
AIRGNEVQAG YPRGIHTLGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRP DENSQSMEQG 
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QAPGPPYPPS GSSQPBPDPN ARMVTHILKS NSWLHC 



AGTAGCAAGA GGCTGGGAAG CCATCACTTA C 

CCCAGCGACT CTAGAAACAC AAGAQCAAGA TGT3GACTTA GTCCAGAAAT 
ACCIGGAAAA ATACTACAAC CTGAAGAATG ATGGGAGGCA AGTTGAA^oji- 
GTGGCCCAGT GGTTGAAAAR TTGAAGCAAA TGCAGGAATI CTTTGGGCT 
GGAAACCAGA TGCTGAAACC CTGAAGGTGA TGAAGCAGCC CAGATGTQG 
TGGCTCAGTT TGTCCTCACT G 



CCTTCCAACT CTGGAGTAAT GTCACACCTC T„„v.„. . ... - 

CAGACATCAT GATATCTTTT GTCAGGGGAG ATCATCGGGA CAO,CTCTCCT T 

CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC CAGGCCCAGG TATTGGAGGG GATGCTCATT 

TTGATGAAGA TGAAAGGTGG ACCAACAATT TCRGAGAGTA CAACTTACAT CGTGTTGCGG 7 20 

CTCATGAACT CGGCCATTCT CTTGGRCTCT CCCRTTCTAC TGATATCGGG GCTTTGATGT 780 

ACCCTAGCTA CACCTTCAGT GGTGATGTTC AGCTAGCTCA GGATGACATT GATGGCATCC 840 

AflGCCATATA TGGACX5TTCC (»AAATCCTG TCCAGCCCAT CGGCCCACAA ACCCCAAAAG 900 

OSTGTGACAG TAAGCTAACC TTTGATGCTA TAACTACGAT TCGGGGAGAA GTGATGTTCT 96 0 

TTAAAGACAG ATTCTACATG CGCACAAATC CCITCTACCC GGAAGTTGAG CTCRATTTCA 1020 

TTTCTSITTT CTGGCCACAA CTGCCAAATG GGCTTGAAGC TGCTTACGAA TTTGCCGACA 10 80 

GAGATGAAGT CCGGTTTTTC AAAGGGAATA AGTACTGGGC TGTTCAGGGA CAGAATGTGC 1140 

TACACGGATA CCCCAAGGAC ATCTACAGCT CCTTTGGCTT CCCTAGAACT GTGAAGCATA 1200 

TCGATGCTGC TCTTTCTGAG GAAAACACTC GAAAAACCTA CTTCTTTGTT GCTAACAAAT 1260 

ACTGGAGGTA TGATGAATAT AAACGATCTA TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

ATGACTTTCC TGGAATTGGC CACAAAGTTG ATGCAGTTTT CATGAAAGAT GGATTTTTCT 13 BO 

ATTTCTTTCA TGGAACS^GA CAATACAAAT TTGATCCTAA AACGAAGAGA ATTTTGACTC 1440 

TCCAGAAAGC TAATAGCTGQ TTCAACTGCA GGAAAAATTG AACATTACTA ATTTGAATGG ISOC 

AAAACACATG GTGTGAGTCC AAAGAAGGTG TTTTCCTGAA GAACTGTCTA TTTTCTCAGT ISSO 

CATTTTTAAC CTCTAGAGTC ACTGATACAC AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

CATATTTTTT TACTATITAG AATGTAGCCC TTTTTGTACT GATATAATTT AGTTCCACAA 16B0 

ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

GATCTTTTCC AGAGTATGCA ACTCTGACGT TGATCCCAGA GAGCAGCTTC AGTGACAAAC IBOO 

ATATCCTTTC AAGACAGAAA GAGACAGGAG ACATGAGTCT TTGCKGGAGG AAAAGCAGCT 1860 

CAAGAACACA TGTGCAGTCA CTGGTGTCAC CCTGGATAGG CAAGGGATAA CTCTTCTAAC 1920 
ACAAAATAAG TGTTTTATGT TTGGAATAAA GTCAACCTTG TTTCTACTGT TTT 



51 



MHSPPPLLLL LFWGVVSHSF PATLETQEQD VDLVQKYLBK YYNLKNDGRQ VEKHRHSGPV 
VEKLKQMQEF FGLKVTGKPD ABTLKVMKQP RCGVPDVAQF VLTEGNPRWS QTHLTYRIBN 
XTPDLPRADV DHAIEKAFQL HSNVTPLTFT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 
LAHAPQPGPG IGGDAHFDED ERWTNNPREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 
TFSGDVQLAQ DDIDGIQAIY GESQNPVQPl GPQTPKACDS KLTFDAITTI RGBVMPPKDR 
PYMRTNPFYP EVELNFISVF WPQLPNGLEA AYBFADHDEV RFPKGNKYWA VQGQMVU^GY 
pmlYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 
GIGHKVDAVP MKDGFPYPFH GTRQYKPDPK TKRILTIX3KA NSWFNCRKN 



Seq ID NOs 7 DMA secruencei 



1 1 1 I 1 ' 

ATGGTCAGAA AGCCIGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 
SSCGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 
AAGAGGAAAG TCACITTACI GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 
GGAATCTTCA TCTCTCCTAA GGGOSTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 
ACCATCTGGA CGGTGTGTGG GGTCCTCTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 
GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 
TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 
GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 
CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 
AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 
GCAATTCTGA TAAITATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 
TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 
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TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT T 
AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 
TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 
AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 
ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG : 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 
TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 
GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 
TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 
CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGAClGGA 
GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAiiTG 
TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 
TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 
TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATA^iA CAAAAGGAGT 
CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 
AACTCTATST AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CA.!iTTCTTGA 
GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 
TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 
TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTaAAA.\GTG 
TTTTCAATTC TGAAAARAAG CATACATCAT GATTATGGCA AAGAGGAGAG A 
ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC A 
GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 
TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTMGT 2 ISO 
AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG G.AAGTTTTAG 22 20 
TTATCTGTCA ■ mrmTlT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2 2 80 
AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 
TACCCCTC3AT GAGTCTATCT AAACATATGC AITTTAAGCC TTCftAATTAC ATTATCAACA 2 400 
TGAGAGAAAT AACCAACAAA GftAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2 4 SO 
CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 
TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TRCATTTTAT ATTCTTCTGT TAAAATATCT 2580 
CTTCAGATGA AACTSTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 
AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 
ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 
CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 
GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 
GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 
CAACATGGAG AAACCCCATC TCTACTAAAA ATACfiAAATT AGCTGGGCAT GGTGGCACAT 3000 
GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGOS 3060 
GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACI 3120 
CCATCTCCAA A 



I 



I I I I I 

?RKPWSTI SKGGYLQGNV NGRLPSLGNK EPEGQEKVQL KEKVTLLRGV SIIIGTIIGA 
GIPISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 
LPAFVRVWVE LLIIRPAATA VISLAFGRYI LEPFFIQCEI PELAIKLITA VGITWMVLN 
SMSVSWSflRI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 
YGMYAYAGWF YLNPVTBEVE NPEKTIPLAl CISMAIVTIG YVLTNVAYPT TINAEELLLS 
HAVAVTFSER LLGHPSLAVP IPVALSCFGS MNGGVFAVSR LFYVASREGH t-PEILS^HV 
EKHTPLPAVl VLHPLTMIML PSGDLDSLLN PLSPAEWLFI GLAVAGLIYL RYKCPMIHEP 
PKVPLPIPAL FSFTCLFMVA LSLYSDPFST GIGPVITLTG VPAYYLPIIW DKKPRWFRIM 
SEKITRTLQI ILEWPEEDK L 

Seq ID NOt 9 DNA sequence 
Nucleic Acid Accession #: FGENESH 
Coding sequence: 1..546 

1 i' r r r i' 

ATGGCCTTGG GCAGCTCa3C CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTG-T 
TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCT J_(_LGal- 
TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTQTGCC CGCAGCCCTG 
CAGCTCGCOG GTGCCCTOSA CCTGCCCGCT GGGTCCTGTG CCTTTGAAG.^ GAGCACTTGC 
GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 
TTCTGGTCCT C3«3GAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAGTTAATT 
CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 
CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 
AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCaCCATGGA 
AAGTAG 
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MALGSSAPVA LQGNAHPPAA PMAGIKCLWL 
QLAGALDLPA GSCAFEESTC GFDSVLASLP 
HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV 



FQWPLGLPE LVQRLLGGAR TETRFVPAAL 
WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 
TVYCRLQGSE KEFETGSMVP FTEMYWIHHG 



1111 
u ftu^Au^^i^G CCTTCGCCTC CCGCGTTTCC TGCCGTCCGC CCTCCCCCGG SO 
ui.liAUl.Ti.CA GGGGCTGCCG CCTAGCAGCT CCCGGCGGGA GAGCGGTTCA GAGCGCGCAC 120 
GGGGGCGGGC GGAGGCGGCC CGGTGCGGGG CGGCCGCGCT GGAGAGAGGC GCGCGCGGAG 180 
ACGCCGGCCC CCCTCTCCGC GTTTGCTCGC TTGCTCCCCG CCTCCCGCAC TCCGCTCGCT 240 
CCCACCCCTT CCCGGOGTGA TTGATCCC3TC AQGGGCGCCG CCGCTGCCGC CGCCGCCGCC 3 00 
GCGGCCGTTC TGAGCCGAGC CGGAACCCIA GCCCX3AGACG GAGCCGGGGC CCGGGCCGGC 360 
GCCATTGCGC GGGCGCCGCG GGAAGACCTT GGOSCGGGGC GGCGGGCCGG GCCAGGCCAT 420 
GCGGGCCGAG TGAGCOMCG CCCGCAGCCC GOSGCGaSGC ATGGCTTCCC CGCGGAGCTC 480 
CGGGCAGCCC GGGCCGCCGC CGCCGCaGCK ACCGCCGCCC GCGCGCCTGC TACTGCTACT 540 
GCTGCTGCCX3 CTGCTGCTGC CTCTGGOSCC OGGGGCCTGG GGCTGGGCGC GGGGCGCCCC SOO 
CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC CTCATGCCGC TCACCAAGGA 660 
GGTGGCCAAG GGCAGCATOG GGCGCGGTCT GCTCCCCGCC GTGGAACTGG CCATCGAGCA 72 0 
GATCCGCAAC C3AGTCACTCC TGCGCCCCTA CTTCCTCGAC CTGCGGCTCT ATGACACGGA 78 0 
GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA ATAAAATACG GGCCTAACCA 840 
CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC ATCATTGCAG AGTCCCTCCA 900 
AGGCTGGAAT CTGSIGCAGC TTTCTTTTGC TGCAACCACG CCTGTTCTAG CCGATAAGAA 960 
AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT GCGGTGRATC CAGCXaVTTCT 1020 
GAAGTTGCTC AAGCACTACC AGTOGRAGOS CGTGGGCAOS CTGACGCAAG ACGTTCAGAG 1080 
GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT GGCGAGGACA TTGAGATTTC 1140 
AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC AAAAAGCTGA AGGGGAATGA 1200 
TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA GCAAAACTGT TCTGTTGTGC 1260 
ATACGAGGAG AACATGTATG GTAGTAAATA TCASTGGATC ATTCCGGGCT GGTACGAGCC 1320 
TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC CGCTGCCTCC GGAAGAATCT 1380 
GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG CCCCTGAGCT CCAAGCAGAT 1440 
CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA GAGTACAACA ACAAGCGGTC 1500 
AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT GGCATCTGGG TCATCGCCAA 1560 
GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC CGGCACCAGC GGATCCAGGA 1620 
CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC AATGCCATGA AOSAGACCAA 1680 
CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG GAGAGAATGG GGACCATTAA 1740 
ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA GAGTACAACM CTGTGGCCGA 1800 
CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA TCOSAACCAC CRAAAGACAA 1860 
GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT CTCTACAGCA TCCTCTCTGC 1920 
CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC TTCTTCAACA TCAAGAACCG 1980 
GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC AACCTTATCA TCCTTGGAGG 2040 
GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT GGATCCTTTG TCTCTGAAAA 2100 
GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC ACCGTGGGCT ACACGACCGC 2160 
TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC ATCTTCAAAA ATGTGAAAAT 2220 
GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC GTGGGGGGCA TGCTGCTGAT 2280 
CGACCTGTGT ATCCTGATCI GCTGGCAGGC TGTGGACCCC CTGCGAAGGA CAGTGGAGAA 2340 
GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC ATCCGCCCTC TCCTGGAGCA 2400 
CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC TATGCCTACA AGGGACTTCT 2460 
CATGTTCTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC GTCAGCATCC CCGCACTCAA 2520 
CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG ATCATGTGCA TCATCGGGGC 2580 
CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG TTCTGCATCG TGGCTCTGGT 2640 
CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC GTGCCGAAGC TCATCACCCT 2700 
GAGAACRAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC CAGTTCACTC AGAATCAGAA 2760 
GAAAGAAGAT TCTBASACGT CCACCTCGGT CACCAGTGTG AACCAAGCCA GCACATCCCG 2820 
CCTGGAGGGC CTACAGTCAG AAAACCATQS CCTGCGAATG AAGATCACAG AGCTGGATAA 288 0 
AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA GAAAAGACCA CCTACATTAA 2 94 0 
ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG GGAAACTTCA CTGAGAGCAC 3 0 00 
AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA AATCCCCAGC TACAGTGGAA 3 0SQ 
CACAACAGAG CCCTCTOGAA CATGCAAAC3A TCCTATAGAA GATATAAACT CTCCAGAACA 3120 
CATCCAGCX3T CGGCTGTCCC TCCAGCTCCC CATCCTCCAC CACGCCTACC TCCCATCCAT 3180 
CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGOSTCAGC CCCACCGCCA GCCCCCGCCA 3240 
CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC CTGTAAGGGT GGGAGGCCTG 33 00 
GGCCCGGGGC CTCCCCCGTG ACAGAACCAC ACTGGGCAGA GGGGTCTGCT GCAGA^CAC 33 60 
TGTCGGCTCT GGCTGCGGAG AAGCTGGGCA CCATGGCTGG CCTCTCAGGA CCACT-^-GAT 34Z0 
GGCACTCAGG TGGACAGGAC GGGGCSGGGG GAGACTTGGC ACCTGACCTC GAGCCTTATT 3480 
TGTGAAGTCC TTATTTCTTC ACAAAGAAGA GGAACGGAAA TGGGACGTCT TCCTTAACAT 3540 
CTGCAAACAA GGAGGCGCTG GGATATCAAA CTTGCAAAAA AAAAAAAAAA RAAAAAAAAR 360U 
CAAAAAAACT AGACAAGGAG AGAGGCACTA GAACTCCAGC TGGAAGTCAC GGAGTGGCTC 3S60 
GAGCAGCCTT GGGAAGAGGC AAGGAGCTTC TGAAGAAACT GCCTCTGCAC ACACATCACT 3720 
GGCTGTGACC CCTCAGGCTA GCCCTTCICC ACTCTGGGGG AGGAGGTGGG AAGGGCCACC 3780 
AGGCCCCCAG CTGCCAGGCC AGCTGACCCC SGCCTTCCTG GAACAGGGAG TCTGCAGGAG 3840 
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CGCAGACAGG CACAGCCCTG GAGCAGGCAG GCCGAGGGCT GCGGCRCTGG AGCAGGCTGA 3 

CTTACATGCT CCACATGGGA CCTGTGTCAC CCAATGAGAT GTTTGTTACT CTGGTAAATG 3 

CCACACGTTA ACACAATAAC ACCCATTCCT GGGACCGTGG GGATTTAGGG CAOSTCACTG 1 

CAGACACGCT CTGCAGCATT CACCGACAGT CTGTCATGCA CCCACXSICGT TGGCCATGTC 1 

CTTGTGTTCC TATCGGATGC TCCCAGTAAC CAGGGGGACC ACCCGAGCTA ATCATGGAAT 4 

GTCTGTTCCC AGCAAACACG ATAAAGAAAG ATTGTGCACT TTAACCTCTC TCATCAGGGC A 
CCAAGGGCTG GCTGGGATTT tTTTTTTTTT TTTCCCACTA ACTTTGTTTC TGACCAAAGT 

GAATTGGAGG CACTCTGCTA AAAGACATCC CCGTAGACAT AGGGGAGAGA GTTGCTGGCT A 

GAGGGCTTCC CTTGGCTTCC AGAAGGCAGC CTTCCATCCA GACAAGCCAG TGAGCTCTCC A 

CCTTGGGATC ACTGGGGTGA TCAGTCAGCA GATTGATTCT CATTCATAAG ATCATTCCTC < 

CCTTTAAATT GAGCCCCTAA GAGCACTGGC CTGGGAGTCA GACAGACCTG GGTTCAAGTC A 

:CACTCC CTACGTGACT TTGATCAGGT CACTAGTGTC TCTCTGAGCC ' 

C CTCTGTAACT TGGGGTTGAA CTAAAACACC TGTCCTGCCT ACCTCACAAG ' 

A GGATTGAAAC TTGATCTTGT CCAGGAAAGC TTTGTACCAA ACAGTGAAGC ' 

C CGTGAGGTAT GAGTATGACT CTGACCTTCA GCCCH'CCCTA CAGCCGGGGG ' 

TGTGGCCCAG AGAAGCTTCC AGCACAGCCC TCTACCCAGA ACATCCGGGC TGGAGGGAGG ' 

CTCCCAGTGA CTTTTCTGAC ATTCCTAGAC AGGTTCATTC TTTaCTCA.AG AAAGGCCTGA ' 

ATGACAATGT CCAGGATGTC TGCACAACTG AGCAGCTCGC TCACTCCCTA AAGAAACCTA - 

TTOGCAGCTT CAACMGCAG GCAAIAATCT CTTCCCAGAA CCACTGCAGT CAGGAATAAA - 

CTGTTTTCIC CACCAGGCTT TGACAAAAGG GCCCACAGGA ATCTTACCAA. TGCCAACATT 1 

TCAAAGCACC CTATTTCAOS TAGCATAGCT TTCTGCTCCC CTTCCCCAAA GAGAGGTTAT 1 

GGAGGTACTG TAGCTTTTAG GGAAAAAAAA ATGTTAACAC ATCACAGGTC AAGTTGAAGT 1 

CATTCTCTGT TTAGGCACTA AAAATCGGTG TTGTCACTCA CTGTCTATTA CCAGTATTTA ] 

CTTCCTTTCT TGATTTCACC AAAACCAAAT TTAATTTAAA GGACCACATT AATTTTTCAA ! 

AGGGRAAGAG ACAATTAATT GTACATAATG TATACACACA CACAAAAAAA AATACCTGTA 1 
GAAATATTAT TCCAGCATAG CAGGAAAACA AACAAAAGTA TTGGACTGTC GGAGGTGAGC 
CTGTGCGTCT GTAACCCTTT GTGA'CTCCTG AGCGTGCGCT GTCTTCTAGG TTAACTCACG 
AAGTACATTC TCTGTCTTAC TGATACTGTA GGTTCACCCA TTTTTTTTTA ATTTCCTCGC 
AAATAACAAG ACCCACAGAA GTGACTCTAG CTACTTAATG GTTCTGTTCT TTTATATGCA 

GCAAACACAC OSTCCaTTTC TGAAGAGGCT TCGGCCTGAA GGCATTTTCC AATGATGTTA SS40 

SIGCAC3«AA OGCTTTAAAT TAGACTGGAA CTGCCAGAAT CAflATGTAAA TGAGGAATTT 5700 



I I I I I I 

MASPRSSGQP GPPPPPPPPP ARLLLLLLLP LLLPLAPGAW GWARGAPEPP PSSPPLSIMG 
LMPLTKEVAK GSIGRGVLPA VBLAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 
IKYGPNHLMV FGGVCPSVTS IIABSLQGWN LVQLSFAATT PVLADKKIWP YPPRTVPSDN 
AVNPAILKLL KHYQWKHVGT LTQDVQRFSE VRNDLTGVLY GEDIEISDTB SPSNDPCTSV 
KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI IPGWYBPSWW BQVHTEANSS 
RCLRKNLLAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER EYNNKHSGVG PSKFHGYAYD 
GIWVIAKTLQ EAMETLHfiSS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWPRNG 
ERMGTIKPTQ FQDSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 
LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 
GSPVSEKTPE TLCTVRTWIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 
VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLBHCBN THMTIWLGIV 
YAYKGLLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 
PCIVALVIIP CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 
NOASTSRLEG LQSENHRLRM KITELDKDLE EVTMQLQDTP EKTTYIKQNH YQELHDILNL 
GNPTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 



Seq ID NO: 13 DNA s 



GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACfi GCAGAGGAAC CTCCAGTCTC 
AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCIGAC TCTAAGTGGC 
ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT ThGThK^ChA 
CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCA.=lQCCA ATTTTGTCCA 
CMTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGPA 
TCX3AAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AJ^GATCTCCT 
TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 
GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 
GGTTAATGTT C^TCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 
GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCSGAATTC 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 
CATACAATTC CAAACACATA C^GGAAGGIA GAAATATCTG AAAATGTATG TGTAAGTATT 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 
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TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 

TTTTCAAATA A 



1 1 I I 1 I 

GCGCAGRCGC CCCAGCCCXC CACCGCCCCC AAAGGGGCGA GCGACGCCAA GCTCTGCGCT 60 

20 CTCTACAAAG AGGCCGAGCT GOGCCTGAAG GGCAGCAGCA ACACCACGGA GTGTGTTCCC 120 

GTGCCCACCT CCGAGCACGT GGCCGAGATC GTGGGCAGGC AAGGCTGCAA GATTAAGGCC IBO 

TTGAGGGCCA AGACCAACAC CTACATCAAG ACACCGGTGR GGGGCGAGGA ACCAGTGTTC 240 

ATGGTGACAG GGCGACGGGA GGACGTGGCC ACAGCCCGGC GGGAAATCAT CTCAGCAGCG 30 0 

GAGCACTTCT CCATGATCCG TGCCTCCCX3C AACAAGTCAG GCGCCGCCTT TGGTGTGGCT 360 

25 CCTGCTCTGC CCGGCCAGGT GftCCATCOST GTGCMGGTGC CCTACCGCGT GGTGGGGCTG 420 

GTGGTGGGCC CCAAAGGGGC AACCATCAAG CGCATCCAGC AGCAAACCAA CACATACATT 460 

ATCACACCAA GCCGTGACCG aSACCOCGTG TTCGAGATCA CGGGTGCCCC AGGCAACGTG 540 

GAGCGTGCGC GCGAGGAGAT OSAGAOSCAC ATCGCGGTGC GCACTGGCAA GATCCTCGAG 600 

TACAACAATG AAAACGACTT CCTGGOWGG AGCCCCGACG CAGCAATCGA TAGCCGCTAC 660 

30 TCCGACGCCT GGCGGGTGCA CCAGCCCGGC TGCAAGCCCC TCTCCACCTT CCGGCAGAAC 720 

AGCCTGGGCT GCATCGGCGA GTGOSGAGTG GACTCTGGCT TTGAGGCCCC ACGCCTGGGT 780 

GAGCAGGGCG GGGACTTTGG CTACGGCGGG TACCTCTTTC OSGGCTATGG CGTGGGCAAG 840 

CAGGATGTGT ACTACGGCGT GGCOGAGACT AGCCCCCCGC TGTGGGCGGG CCAGGAGAAC 900 

GCCACGCCCA CCTCCGTGCT CTTCTCCTCT GCCTCCTCCT CCTCCTCCTC TTCCGCCAAG 960 

35 GCCCGCGCTG GGCCCCCGGG CGCACACCGC TCCCCTGCCA CTTCCGCGGG ACCCGAGCTG 1020 

GCCGGACTCC CJ3AGGCGCCC CCCGGGAGAG CCGCTCCAGG GCTTCTCTAA ACTTGGTGGG lOBO 

GGCGGCCTGC GGAGCCCCGG CGGCGGGCGG GATTGCATGG TCTGCTTTGA GAGCGAAGTG 1140 

ACTGCCGCCC TTGTGCCCTG CGGACACAAC CTGTTCTGCA TGGAGTGTGC AGTACGCATC 1200 

TGCGAGAGGA CGGACCCAGA GTGTCCCGTC TGCCACATCA CAGCCACGCA AGCCATCCGA 1260 

40 ATATTCTCCT AAGCCCCGTG CCCCATGCCT CCGGGGCCCA CTCCACTGGG CCCAC3CCTGG 1320 

ACCTGTTTTC CACTAAAGCC TTTTGGAAAG CGGTGATTTG AGGQGCAAGG TGCTTAGAGA 1380 

TACTCGCTCG CTGGGGAAGG GGGGAGGGAG GCAGTGGTGG CTGGAGGGTG CGCCACTTTC 1440 

AGAGCCTCTG GTCACCCTGT CCTCGAAAGA TTGGGAGGGG GCCAGACTGA AAATTTTACT 1500 

AGAGTTACAA CTCTGATACC TCAACACACC CTTAAATCTG GAAGCAGCTA AGAGAAACTT 1560 

45 TTGTTTTGCC AGAGGTGGCC ACTAAGGCAT TCTGACGCCC TCTGCCCACC TCCCCCGCTG 1620 

TGTGTCACTC CACCCCTTCT TCCGAGGAGG GGGTGGGTAA AAGGGAGAGG GAGAATTACC 1680 

ACCTGTATCT AGAGGTGCTC TTTCCAATCC CTAAGCCCTC TGGTCCIGAC CTCCGACCTC 1740 

CTAACATGAC CCTTTACCTC CCACCCCACC CCCATATCCI GTTTGGGAAA CTGTCACCAG 1800 

TTTCCAGCAG TGTAAGGGAG TTGGAGTCCT ATCAGAAGTT GCATAGATCT TCTAGGGGTT 1860 

50 GGGGAGAGAA GCATGTCAAT CGTTTCTGTG GCTGAAAGGC TCAGAAGCCA TCTGTCCOCA 1920 

CAAAGCTGGG CTAGAGGAAT CTGGAGAGGA GTCCTCCTCT CTGCCCCTGT CCCCTGCAGT 1980 
GTTTCCCTTC ACTCTCTCCG CCTATCTTCC CTTCCTTTGG GATCTTCCCT TTCCTCAACT 
CTTTCCTTTC CCTCCAGCTC TTTGCTTTGC TTTCTTTTGG TGGCTGTCAC TCCCAGCTCT 
GTCTTGTTCC TTGTCTTTGT CTTTCTTCCC TTCCCCCTGC CCCTGCCCCT ACCAGCCCAG 
55 CTTTGGGGAC ACCATCCTTC TGGGGAGAAG TAGGGGGAGG AATATTTGGA TGGTCCCTCC 
S?CCTCTIC AGGCATCTGG AGGCCCTCTC CCCCAC^CT CCAAAGAAAC ATCTCAAATT 



2100 



ATTCCTCTTC AGGCATCTGG AGGCCCTCTC CCt:CAi-n.i--i- „x^iw™-* 
ATTGATGGAA TGTATCCCCA TTCTCAGTGA AAATGTGAGG AGGGGACTAA TACTGGGGTA 
AAGGGTCAAA CCCCCACCTT CATCACTATG GGCATTATAT TTAGGGAGTA GTTCTTGGGC 

Stttct ggtigtggaa gtgggggcgc cagagtagtg tgtctgctat ttaaaggagc 

AGGAAAGGGC GTGAGGCAGG AGGAGAGACT GGTGGAGGGA AGAGCTGCTC CTCCCATGCA 

OTG^ACT CCCTGCACCC CTCTCRACCT GACCTGAACC TTTATTGAAT CCTTATTAGC 2580 

TTGAATCCTT ATTAGCTTGA ATCCICCATG CAAATCATGG AGTCTGTGTC CCACCTGATG 2640 

TGGTTGAGGA GAAGCCAGGT CTTCRAAGAG GGGTCAGCCT GGGGCAAAGC AGGACTGGGG 2700 

GGAGGTGGGC AGCAGGGCCT ATTCTGAGAA TCACATATTG TTACAGGCCT TGCACCCCCT 27S0 

??GCTGCTTC CCTCCTGCTC ATTIGGGGCT GCCACCAGCT CTCCACCCTC CTGGTTCCGC 2820 

TGGCCGGGCC AAGAGAGGAT GGAGGGATGG GAGTCCCAGG AGATCCTTG7 AflATAGTGGG 2880 
GTGGGACTGT TCTCAGTGAT CACCCGAGCA CTTAAAGCTC CAGAGTCCCA TTCTTCCTGG 
ATGGAGCAGG TGGAGGTGCA GAGGGGATTT CCTCCTCTCC TTCCTCCTGT CGAGAATTAA 
CACCTCTCCA CAGCCTTCCC CTCXa^GAACA CCAGCCAGGG AGGGGTGGGG AAGGAGGTCA 
CAGCCAAGAA AACTGCCCTG TGACGACTTC CCTCCTTCCC GCCTATGTGA GCCATCCTGA 
GATGTCTGTA CAATAGAAAC CAAACCAAAT GGGCACCCTC GGTTGCCGGG ff^'^^ 

GGAGGGGGGT GGGAAGAAGG GATCTCTGTC TGTCGTCCCC CTCCCCCTCT CCACIOTTA 3240 

CCCACAAAGG CAGAAGACTG TTACACTAGG GGGCTCAGCA AATTCAATCC CACCCTTACC 3300 

AATTGAGCCA AACCTAGAAA CAAACACAAA ACACGAATAG TGAGAGACAA AATAGAGGAG 3360 

^IaAGAGAG CATGAGAGGG AGCGAGACAG GCGACCAACA CAGAGGAGAG AAAACAAAAA 3420 

TA^^AAAAAA AAAAAA.WA AGCAGTTCTT TATAATTTAA TATTCTATTT TAATAAAGGC 3480 

GTTTATTACC ATATAAATGT AGCAAAGAAC CTGGGCTAAT ATGAAAAAAA AAGACTTTTT 3540 

AT?aGGTAAT TTATTATATC AAAAGGATAT TTTATTTTAT GATAAAGTGA TCCTTAAAAA 3600 
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^ CTTTAGAAGG 
TTGTATTCAG AGTTAAATCT 
TTGCTTTTTT CCCCCACTTT 
ATCCCGGGGG TGAATGGTGG 
r GGATTTCRCT 



GACTGAGGAT CCTGGGGTGT C 
GGAGGGGGAA TCCAGAATTA AGGCCTAGCA GGCCTATAGG AACCCTCAQA GATGTGTGAG 
ATTTAAGAGA TCTAGATTTT TTTTTAACCA AAAACAAGAG AGAAAGAGAA GAAAA.AGAGA 
AACCGAGGGG TTTAAAAGAA AAGAATACTA CAAAATAATA ATTATTAATA AIAAT.iiATTC 
AAATTTATTT CATATAATCC TAGAGAGAGA AAGAAACAAT TACTAGTTAC TTAGTAGACA 
ATATTAAGAT AGCTTAAAGT TTAGTAGCAT TGAGGGCCCC TGGGTCCAQT AGAATGTATA 
AAAGTTGTAA GGAAAAGATA AATAGAGGAG GGAAGTGGCT GAGTCCACCC TGAGTTGCCC 
AATCTTCAGA TACCAGGGTT GGATCAGGTT GCTAGTTTAA GATTGGGAGC TTCCAGTCTG 
CTGGGGTTC3A TTCTGAGAAT CCTTGGATTT TTAAATTGTA GGACAAAGAA ATGAGGGGTT 
CATTTCCCAG GGTCTTGGAA AGGATGCACA CTGATCATCI CAATAAGACA GGGGCTGGGT 
TGGGGGCAGC AGAGGRGGCC AAGCACATTC ACCTGCACCC CTAGTACCTG GGCAGCCCAT 
ACTCCAATGT GGTATGTCCC CTCCTGGGGC TCCCAGCTCA AACCCTCCCA TGCCTGCTTC 
CCCCAGGCCT AACTGAGGAA GTCCTTCTTG AAGTGTGACC TCG3TCCAC7 TCTCTACAGA 
TTGATTTAAG AGCCTGGGAA GTCATTCCAC AAACAGACAC ACATGCACAC ACGCTTCTCA 
CCTTCAGAGC TTCASGAGCA CTGAGGCGAT CAGTCCCCIA CCCCTGTTCC CATCCAGCTT 
TCCACTTAGC TTTGACCTCC ATGGCAGCAG TAGCAGTAflC AATCTCAGTA ATTGTTCTTT 
AAAGCTGACT OSTTCTTCAC CTACTTGCAA AGTGCTTTCT TGTCTCATAA AAGTTAGATT 
CCAAGAAGGA CTTCCCACGG AGTGGAGTGG AAACACTGIC CTTGAAGGCC TGGGAGAAAG 
GCATCCCCAT GGGCACAGAG GCTGGGGAAA GGCRCAGGGA CTTTGGGTGA CCCTAACCCT 
GACCCTCTGC TCCAGTTCAC CTCCATCTAT ATSTOTTCAG GTAGGGGTCA TCTACTGTAC 
CCTGGCCTGG GAACACATTG CCCTCX:CCAC ACAAAACTGG AGGGCTTGGC TTCTGCGTGT 
GAGAAATCAA CATTTTTAAA GCACTTGCCT TCTACCAACC CCAGCTTGCA ATCACTGGGC 
CTTCCCCTCC TATCCAAGGG GTTGGAGGGG CCCCTTGGCT CTCCTTTTGG CAGGAGGAGC 
CTGCTTCATT ACACCAATGA CTCTGCCATC CCCCTCCCTG GCCCTAGACC CCAAACACAT 
CTCCCTCTAC CCAATTTACT CTTCICGCCC CACCTAGGGA CRGATTCCCC CTGCTCTTTT 
TGTCCTAGAA ACCCCGCTAG TTTGGGATGG TAGCGTCTGG GGTGGGGAGG GCTTCCCCTT 
CCCCACTCGA GGGTGCGGGT GGGGAAGGGG GGGTGGGTGG AGACAGCCCT GGGGCAGGGA 
GGATGGTCTC TCCACTGTAG AAAGTAGAGT AGGATTGTGG TCAGACTTAA TTTGAGGCAT 
CIAGTGAAGA CACGTACAAA TCCACCAAGG AAAAAGRTTT CAAAAGCAAA ATAAAAGCGG 
GAAATAAAAC AGACCCAAGA ATAATCAAGT CAAAGTGATG TTGCACAAAA TGCAGAGAAA 
CCAAGAAGGG GGAGGGTTAA TGTATTAAAT GTGCTATTAA GAACTTAATT TTATTAAAAG 
TACTATTACT TAAGGCTC 



1 I I I I I 
MVTGRREDVA TARREIISAA EHFSMIRASR NKSGAAFGVA PALPGQVTIR VRWx^,.>j,^ 
WGPKGATIK RIQQQTNTYI ITPSRDRDPV FEITGAPGNV ERAREEIETH lAVRTGKILB 120 
^m.^TUOAM SI.ncTGECGV DSGPEAPRLG 180 



WGPKGATIK RIQQQTNTYI ITfbKUKUFV ttii.iu/4e->Ji-iv n-^^i^ii.*^^" - 

YNNENDPLAG SPDAAIDSRY SDAWRVHQPG CKPLSTPRQN SLGCIGECGV DSGPEAPRLG 

EOGGDPGYGG YLFPGYGVGK QDVYYGVAET SPPLWAGQEM ATPTSVLFSS ASSSSSSSAK 

AEAGPPGAHR SPATSAGPEL AGLPRRPPGE PLQGPSKLGG GGLRSPGGGR DCMVCFBSBV 

TAALVPCGHK LFCMECAVRI CERTDPECPV C3JITATQAIR IPS 



Seq ID NO: 17 DN 
Nucleic Acid Acce 



Lgcccagca gccccggggc ggatggctcc ggccgcctgg ciccgcagcg cggccgcgcg so 

CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCA03CAG GAAGCCCCCC GGCCTGCCAG 240 

CSGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCaCCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AQGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTI SOO 

CX3ACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACS^ ACAGCAGCCA AGGCCCTGAT 720 

GTCQGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCaCCTCC AGGACCCCAG CCCTGGGCCC 840 

CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGAOSCCC CGCCAGATGC 900 

CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

RGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 
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T GAGGTTCCaS GTCCATGCTG CCTTGGTCTG 
GGGTCCCGAG MGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 
CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA C3AGGGGTGCC 
CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACITCC TGOSaSGCCG 
CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCITCC CCCGTCTCGT 
T GTGCCGAGCC TGCCAACACT TTCCTCIGAC CATGGCTTGG 
: CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 
G CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 
C AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 
T TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 
GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT 
T GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT 

TGGGGCC CCATGGCCTT CAGCCCTGGC 

G GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 

ATCTGTCTGC CTTCIGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 
GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCRGGCC CTGGAGGCTG 2100 
CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 
ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 
TTTTTAAACT GAGGRTTGTC ATTAAACACA GTTGTTTTCT 



I 1 I I I I 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH LHAERRGPQP WHAALPSSPA 
PAPATQBAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 
LVQBQVRQTM ABALKVWSDV TPLTPTBVHE GRADIMIDFA RYWHGDDLPF DGPGGILAHA 
PPPKTHREGD VHFDYDETWT IGDDCX3TDLL QVfiAHEFGHV LGLQHTTAAK ALMSAFYTFR 
YPLSLSPDDC RGVQHLYGQP MPTVTSRTPA LGPQAGIDTN BIAPLEPDAP PDACEASFDA 
VSTIRGELFF FKAGPVWRLR GGQLQPGYPA LASEHWQGLP SPVDAAFEDA QGHIWFFQGA 
QYWVYDGEKP VLGPAPLTEL GLVRFPVHAA LVWGPBKNKI YPPRGRDYWR PHPSTRRVDS 
PVPRRATDWR GVPSEIDAAP QDADGYAYFL RGRLYMKFDP VKVKALEGFP RLVGPDFPGC 
ABPANTPL 



I I 1 1 I 1 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCIGCTG 60 

CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCIGCAA 120 

CTTCACTCCT GGGGCCAGCA AGACCACGAA TGCACCGAGA GGAATGftACA ACTCTGGACa 180 

CACCATCTTT AAGAACCGTA ATACTCACCG CAAGGGTCTG CAACTTCATT CTTGAAGTCA 240 

GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 

TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAGACTA TTGCCAAGTG GTGAGACCAT 360 

CACCAP.GCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 

GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 480 

TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGOS TCCTGGCTAA E40 

CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 

GGCCCGGGAC GGGGCCCACG TGGTCATCAG CAGCCGGAAG CAGCAGAAOS TGGACCGGGC 660 

CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCK ACGTGGGGAA 720 

GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGICGACTT 780 

CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 

GATCTGGGAC AAGATCCTAA GTGTGAACGT GAAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 

GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 

TSATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 

TMAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGOJ TGGTTCCAGG 1080 

AATTATAAAA ACTGACTTCA GCAAAGTGIT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1140 

CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 

CTTCCTGTCC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 12 SO 

CTCCACTOSG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 

CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 

AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TOSGCATTCT 1440 



LsAVARGYQ LpHPCARLS IrMSSTGIDR KGVLANRVAV VTGSTSGIGF MARRLARDG 



'Q GMPHPCTiRLS VRMSbTlilUK n.t.vumKv«v v looiLjvjivj^ 

AHWISSRXQ QMVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 
AGVNPLVGST LGTSBQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCWPGIIKT DFSKVFHGNE SLWKNPKEHH 
QLQRIGESED CAGIVSFLCS PDASYVNGEK lAVAGYSTRL 
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I I I I I I 

GCCGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGCGCGAGC AGCTGCTCCA 60 

ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG CTGGTAGCTC 120 

GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG AGTTCTACAA 180 

10 CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT TGTTTCGTCA 240 

GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG TTGTAGTGGG 3 00 

AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC TTGATGAACT 3 SO 

TGCAGTCCTT TGGGTTCTGA TGTCTGCTTT GGCCATGTGG TTCCCCAGAA GGTATCTACC 420 

AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC TGTCTGCGGT 4 BO 

15 TACGACGTGC CTGGCATTTG TCAAGCCTGC C=lTCAACAAC ATCTCTCTGA TGACCCTGGG 540 

AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA TGCGTGTGTT 500 

TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT GGATCAGTGA 560 

CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT GCATGTGGCA 720 

CATCCTCATC TOCCTTGCTG CCTACCTOGG CTGTGTATGC TTTGCCTACT TTGATGCTGC 780 

20 CTCAOAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA AATGGGCCTT B40 

CATTGGTGTC CCCTATGTGT CCCTCCK3TG TGCCAACAAG AAATCATCAG TCAAGATCAC 900 

GTGATGGCAA GATGGTGGCT GGCTTCTCTG CITATOSCCC CTCATGCAGT GGGCTTCCTT 960 

TGCTAGGAAG ACAGCCAAGG GAGTTOSAAT AGTTGGGGTG TGGGCTATCT TTTCAAAAAT 1020 

CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAGGGATT TAAACTTTGT 1080 

25 CATATGOTAC AAATATTCCC TGCCCCCCTG CRGTTTCCCA TTTGTCTTTC AGTATGTTAA 1140 

TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA TCTTTTCTTT 1200 

AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT AATGTGTCIG 1260 

TGGACCACCT GGATTCCACT GTACABGGGG AAAAOTGTCT ATTCCTTTCC CAAAGATGGA 1320 

AAATGGAGGG CTTAGGGACA CTAGAIGCAT CTTTCTCAGC ATCACTTCCA GATGCAGTGA 1380 

30 CTTGTTGGGC TGCGTCCTTA ATGGCKATGG CAGAGCAGTC CCTTGGGGGA TCCAGCCCTG 1440 

T^CAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT TCCCAGGOSC 1500 

CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC CCTCTGACTC 1560 

TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTO TCCTTAAACA AAACCAAACA 1620 

GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC ATTTCTCAAC 1680 

35 TAACCCAGGT TTTACATGCA TCTGIGAATC CTTTTACTAC TACCTCTGTG GAGAGATGGA 1740 

GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA CCTACACTCC 1800 

ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA TTTTTAAATA 1860 

CTGACATCAC TTCCTCTTCC CCCTCCCAGC CX^GCTCAGC AOCCTCAAAT CTACAGAGAA 1920 

GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG ACCTCTGTAA 1980 

40 GTTTACCAAA GGGCTCCTCA CAATTGTCGT CGGGGTTCTG GTTCRAAATT TGGAGCAAAC 2040 

ATGAAGTTTT TGGAAACGTT TTCTCATTIG AAGCCTCCAG TATGCTGTAC TATTCTGGAA 2100 

ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA TCATGTTCTT 2160 

CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC TCACCTTTCT 2220 

CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG CAGCAGOSCA 2280 

45 AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT GAATCITCTC 2340 

TTCRTCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG GGAATTATCR 2400 

ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC CATTGTTGGG 2460 

AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC CTACATATGT 2520 

AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA GTTAGAATTT 2580 

50 GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTSGAGTTTA AAAATGAATG 2640 

ACTTTATGCT ACATCTGTCG TTATCAAATT ATATAGGTTG TTGAOAAGCA GAAOSCTGTT 2700 

TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT TATTAAATTT 2760 

ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA AATGCAAATT 2820 

AACAAAITGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA TATGTTGGTT 2880 

55 TATTATAOTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG GTCTTGCTCT 2940 

ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT G.CTCCCAGG 3000 
CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG Ar.TArTr.r.TG CGTGCCT^CA 



CTCAAGTGAT tJdUTUAULi i^ouun-^-^^ rtvjvji-ivjs-xNj^^ 

CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCAT7TT GCCCAGG^TG 3120 

GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGI GCTQGGWA 3180 

ioCATGAGCC GTCACGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG AATTAAATAT 3240 

TCGTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA GAACTGCIGT 3300 

TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT CATTCTTATT 33 60 

TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCTCATCTA CTTTATGGTA TAAATGCTCC 3 420 

AAGGTAGGGO GTTTGGTATA TATTTTAAGC CCGGCTTTTT tTTTTTTTTT TTTTTTTTTT 3 

TTTTTAATQT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA TGAGCTGAGA 3540 

AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAACTTCACT TTTGAACAGA 3 600 

AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT ACCCATCTTA 3 650 

TTTTAAGTTO GGTTTATGGC ATOSCTCATA TATGTAAAAG CACTACAAAC TCTTTAAAGA 3 720 

AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT TCTATTGCCC 3780 

AGGTATAATC CTTGTTAATA TTTTGSTTTG GTCTCCTCTT TTTTTCC.CC .WATAGTTG 3840 

TAAATAAATG ATGTCTTTCa GAGTTGACAT TTATCCTGTA GCTTGAAIGG CATGTAAATG 3 900 

CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGITA GCTATATGCT 3960 

TGAATGCTGA TCACTGGATT CTGAGACIGA CTACTGAGTC TACCTTTTTA ATCAAGCCTA 4020 

ACATOAATOO GCTCCAAAAA GTAATGAATG TAATTQTACI TTTTGATGTG CCTCTGCACT 4080 

TGGCTTGQTG AOTCATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT TTTGATATGT 4140 

TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA GAATCTCCCT 4200 
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MGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISHV liFFIIiPPICM CLFRQYATCF 
NSGIYLIWTL LVWGIGSVY FHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLPKIFRN 
DRGRFKVWS VLSAVTTCIjA FVKPAINNIS LMTLGVPCTA LLIAELKROJ NMRVPKLGLF 
SGLWWTLALF CWISDRAFCE LLSSFNFPYIi HCMWHILICL AAYLGCVCFA YFDAASBIPB 
QGPVIKFWPN EKWAFIGVPY VSLIjCAMKKS SVKIT 



T CTGTTTAATG TCATATATCT GAGATGTTCT G 

C AGGAGGACAC ATTCTTGATC GGTATTTTAT TCTATTATTT TTATTAGTTC 

CAATCCTTAT GACTCTGTTT AAAATGAGTA TCTGCATTAT TGTGCAAACA CTTCTGAGAT 
CTCATCACAT ACCTGATCCC ATTTAACTTC AGTCTATCCC TCCATTACTT TTTGTGGAAA 
GTGCTATCTA AGTTGGAGSI TCTOCTTGAG TATTCTTAAT TTTTCTTCIT TTTAATGCAT 
ACGGTGGTCA AAGTAAATAG CTAACATATT GGCATCAAAC CATTGCCTAT TGAAGCAAGC 
TTTQAATGAA ATTGGTTTGT CATTGACCAC TGTAAGAAAC TCATTTGTAA AGATAATACT 
GAAAAACATC CIGACTCTTA TGTATOCTTG CCTGTCTCTG AACTCCACCT CTATCAGCAT 
ACTCAATACA TTGCCTCTTT CCAACATTTA TCTCCTATTT CAGGAGAAAG TTTTGTGGGA 
GCGTCTACCA OSTTGCAAAG ATTCAACATT TCTCTTAAAA ATAGGASITT CTTTTAGATA 
TAAGCAATGA AATTTCAAGC AACAATATTA CTTCTCTCAT GTCCTTTGCT TTTTTAAGAT 
CIGAACCTCA AGATTCAAAT GGATTTTCAT TGAAATTCAC CACGATCCCC TCTGCAGCTG 
TGGTCTGAAA GTACCATTTC TGCATTTATT TTAGCCCATG GAATAACTGT GCTGAGAAAC 
CACAGAGTCA ATCAGATTCA AAATGTTAAA ATCCTTCXTG CTTGGAGTTT TCCGTCTTCA 
CATCAAAGCA TTTCATGCCC GICAGCAACI TTTTAATGCA TTTGCTCATG GTTTGCAGAA 
TTTCCATTTA AAGACTTTCC TTGGCTGACT TCTCTGATGA QGTTTCCTGC TTGCCTC 



1 1 I I I ' . 

GTGTGTAGTC GGGTATGAGA ACGATTGCAA GCAAAAGCAG CTGAGGGAOS CTGTCCAAGG 
AGTAGGCTCA GTAGCACGAA CTGCAAATAC TTGAGAAGGG AAAAGTTCAG CCTTGTGCTG 
GAAAGTGGTT TAGCATGCTG GAGCTGGTTT TCTGCTTGGT AGCCCTACAA CTTTGGCCCA 
ACTACTTGGC CTCTGGGACT CAGATTCCTC CTCTTTAAAA TGGTGCTAAT AATAGCACCC 
ACCCTCTGTG AGGAGGATGC TGTGAGGACA AAATGAGATC ATCCACATAA GCCGTGAACC 
CTGTTCCTGG TAAGCTCCTG GAAAGAAGTC TATGGATTAI ACTCAACCTA CACTCCAGTT 
AAAGGAACAT CTACACACAG AGGAAATGAA TAACATGAAG TGAAGTCTTC ATCTCCATTC 
CCAACAGTCC CCATTCTACT TGCAGAAAGG TTGCTTACAC TGAAAATCAG TTTATTTTCC 
CCTGGTGCAA AGAACAGTCG TTTCTCCAAA ACTGAAGCTG GAAATTATCT GAAATATCAG 
GTCCTCCGGA AAAGGGACGT GAAGCCCCCT TTGTAATTTC TGCATTAGCG TGCTCTCCTG 
GCAAGCMGA AACCTCATCA GAGAAGTCAG CCAAGGAAAG TCTTTAAATG O^TTGTGC 
AAACGAGGAG CAAATGCATT AAAAAGTTGC TGACGGGCAT GAAATGCTTT SATGTGAAGA 
CGGAAAACTC CAAGCAGGAA GGATTTTAAC ATTTTGAATC TGATTGACTC TGTGGTIItl 
CAGCACAGTT ATTCCATGGG CTAAAATAAA TGCAGAAATG GTACTTTCAG ACCACAGCTG 
CAGAGGGGAT CGTGGTGAAT TTCAATGAAA ATCCATTTGA ATCTTGAGGT TCAGATCTTA 
^GCAAA GGACATGAGA GAAGTAATAT TGTTGCTTGA AATTTCATTG CTTATATCTA 
AAAGAAACTC CTATTTTTAA GAGAAATGTT GAATCTTTGC AACGTGGTAG ACGCTCCCAC 
AAAACTTTCT CCTGAAATAG GAGATAAATG TTGGAAAGAG GCRATGTATT GAGTATGCTG 
ATAGAGGTGG AGTTCAGAGA CAGGCAAGCA TACATAAGAG TCAGGATGTT TTTCAGTATT 
ATCTTTACAA ATGAGTTTCT TACAGTGGTC AATGACAAAC CAATTTCATT CAAAGCTTGC 
TTCAATAGGC AATGGTTTGA TGCCAATATG TTAGCTATTT ACTTTGACCA CCGTATGCAT 
TAMAAGAAG AAAAATTAAG AATACTCAAG CAGAACCTCC AACTTAGATA GCACTTTCCA 
CMAAAGTAA TOGAGGGATA GACTGAAGTT AAATGGGATC AGGTATGTGA TGAGATCTCA 
GRAGIGTTTG CACAATAATG CAOATACTCA TTTTAAACAG AGTCATAAGG ATTQGAACTA 
ATAAAAATAA TAGAATAAAA TACCGATCAA GAATGIGTCC TCCTGCGTAT CTGGGTTTTG 
AATTCTGGCI CCACAGAACT TGTCAGATAT ATGACRTTAA AC 



ItGTGTAGTC GGGTATGAGA ACGATTGCAA GCAAAAGCAG CTGAGGGACG CTGTCCAAGG 
AGTAGGCTCA GTAGCACGAA CTGCAAATAC TTGAGAAGGG AAAAGTTCAG CCTTGTGCTG 
GAAAGTGGTT TAGCATGCTG GAGCTGGTTT TCTGCTTGGT AGCCCTACAA CTTTGGCCCA 
urTACTTGGC CTCTGGGACT CAGATTCCTC CTCTTTAAAA TGGTGCTAAT AATAGCACCC 
75 AKCTCTGTG AGGAGGATGC TGTGAGGACA AAATGAGATC ATCCACATAA GCCGTGAACC 
CTGTTCCTGG TAAGGAAATG AATAACATGA AGTGAAGTCT TCATCTCCAT TCCCAACAGT 
CCCCATTCTA CTTGCAGAAA GGTTGCTTAC ACTGAAAATC AGTTTATTTT CCCCTGGTGC 
AAAGAACAGT CGTTTCTCCA AAACTGAAGC TGGAAATTAT CTOAAATATC AGGTCCTCCG 
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GAAAAGGGAC GTGAAGCCCC CTTTGTAATT TCTGCATTAG CGTGCTCTCC TGGCABGCAG 540 

GAAACCTCAT CAGAGAAGTC AGCCAAGGAA AGTCTTTAAA TGGAAATTGT GCAAAOGAGG 600 

AGCAAATGCA TTAAAAAGTT GCTGACGGGC ATGAAATGCT TTGATGTGAA GACGGftAAAC 660 

TCCAAGCAGG AAGGATTTTA ACATTTTGAA TCTGATTGAC TCTGTGGTTT CTCAGCACAG 720 

TTATTCCATG GGCTAAAATA AATGCAGAAA TGGTACTTTC AGACCACAGC TGCAGAGGGG 7B0 

ATCGTGGTGA ATTTCAATGA AAATCCATTT GAATCTTGAG GTTCAQATCT TAAAAAAGCA 840 

AAGGACATGA GAGAAGTAAT ATTGTTGCTT 3AAATTTCAT TGCTTATATC TAAAAGTIAAC 900 

TCCTATTTTT AAGAGAAATG TTGAATCTTT GCAACGTGGT AGACGCTCCC ACAAAACTTT 960 

T AGGAGATAAA TGTTGGAAAG AOGCAATGTA TTGAGTATGC TGATAGAGGT 1020 

A GACAGGCAAG CATACATAAG AGTCAGGATG TTTTTCAGTA TTATCTTTAC 1080 

^CAGTGG TCAATGACAA ACCAATTTCA TTCAA.^GCTT GCTTCAATAG 114 0 

T GATGCCAATA TGTTAGCTAT TTACTTTGAC CACCGTATGC ATTAAAAAGA 1200 

^TACTCA AGCAGAACCT CCAACTTAGA TAGCACTTTC CACAAAAACT 12 60 

AATGGAGGGA TAGACTGAAG TTAAATGGGA TCAGGTATGT GATGAGATCT CAGMGTGTT 13 2 0 

^ TGCAGATACT CATTTTAAAC AGAGTCATAA GGATTGGAAC TAATAAAAAT 1380 

\ AATACCGATC AAGAATGTGT CCTCCTGCGT ATCTGGGTTT TGAATTCTGG 1440 
^ CTTGTCAGAT ATATGACATT AAAC 
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25 CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGIAA AACTGATGSG GGTTTTTGCT ( 

GTCCCTCCCC TCRGCGCCBA CRCCATGTGG ATCCAGGTTC GGACCATGGA CGGGAGGCAG 
ACCCRCACGG TGGACTOGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 
CAGGAGCTQT TCCACGTOGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 
GAGGACGGCC ATACCCICTT CGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 
30 GTCCGCCAGA GCCTOSIGCT CCCCCACAGC ACCAAGGAGC GGGACTCCGA GCTCTCCGAC 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AGTCCTCCaC CCACGGOSAG • 
GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 
GGGCTGTACA AGGTCRATGA GTACGTCGAT GCTCGGGACA CGAACATGGG GGCGTGGTTT 
GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGACGAGCC CTGCAGCTCC 
35 ACGTCCAGGC CGGCGCIGGA GGAGGACGTC ATTTACCACG TGAAATACGA CGACTACCCG 
GAGAACGGCG TGGTCCAGAT GAACTCCAQG GACGTCCGAG CGOSCGCCCG CaCCATCATC 
AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC OSACAACCCC 
AAGGAGCGOG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGCGAGAC CAGGACGGCG 
CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 
40 TTCGTGGACO AAGTCTTCAA GATTGAGCGG CCGGGK3AAQ GGAGCCCCAT GGTTGACAAC 
CCCATGAOAC GGAAGAGCGG GCCGTCCTGC AAGCACTGCA AGGACGACGT GAACAGACTC 
TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGO ACCCCGACAA GCAGCTCATG 
TOCGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAGCAGTGTT 
CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCRGCGA GOTOGTACTG 
45 GCGGGAGAGC GGCIGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTOSGC CACATCGTOC 
TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 
GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 
CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 
AGCAACGACG GAGCGTACIC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 
50 GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 
GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 
TTTGCTCCCfl TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 
GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 
AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 
55 TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCIGGCCC TTGGACGAAG 
GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 
GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 
CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGOS GAAGTCGGCA 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 
60 CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 
CTGTGGAATG AOGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 
TTGTTCCTOA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 
CGGCCCATCA CGACCGTCTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 
65 CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 
CGOTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTG.V. A^CGTGTCGG AGGGCICGTT 
CATCGGCACT GATTTTOTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT ICCTCCGTTC 
CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 
GAATTTATGT ATTCIGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 
70 CATAAAAGCC TGCAATTTCI CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 
ACIACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCIGCCAC 
CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 
AGCAAGCATC TTCCTGACaG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 
TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 
^ « — ,nn*>n/^BK& rr-TTTnrPTr ^Bltfim'TY'r CCCACCAGAC 



TGGCCCGTGG CAGCCCliTBG ljATUUi^i>a>a — 
AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGOCATCC 
TGCTTAGCGT CTGAGATOCG CGTGAAAAGT CCTCTGCCCA OSAGAGCAGG GAGTTGGGGC 
CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCaOSTGGG GCCAGGCGTG TGACTGACGC 
TCTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 
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GRTTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAftAT 33 00 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

5 TTTTTTTTGT AGTTACTGTA TATGTACCM GAAAGATATA AOSTTAGGGI TTGGTTQTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAfiGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3560 

GAGCAATGTT AnTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 

10 TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 3828 



I I I I I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEEIjRR KIQELFHVEP GLQRLFYRGK QMEDGHTLPD 
YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAABTDSR 
PADEDMWDET ELGLYKVNEY VDAEDTNMGA WPEAQWRVT RKAPSRDEPC SSTSRPALBB 

20 DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 
AEISRKRBTR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPM^/ DNPMRRKSGP 
SCKHCKDDVN RLCRVCACHL CGGRQDPDKO LMCDECDMAF HIYCLDPPLS SVPSEDEWYC 
PECRNDASEV VLAGERLRES KKKAKMASAT SSSORDWGKG MACVGRTKEC TIVPSNBYGP 
IPGIPVGTMW RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VIAGGYEDDV DHGNFFTYTG 

25 SGGRDLSGNK RTABQSCDQK LTNTNRALAL HCFAPIHDQE GAEAKDWRSG KPVRWRNVK 
GGKNSKYAPA EGNRYDGIYK WKYWPBKGK SGPLVWRYLL RRDDDEPGPW TKEGKDRIKK 
LGLTMQYPEG YLEALAHRER EKENSKREBE EQQBGGFASP RTGKGKWKRK SAGGGPSRAG 
SPRRTSKKTK VBPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQI.FLSKVBB 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SPRAQVPSCP ACSIYDLGRSY AMQVNQPLQT 

30 VLNQLPPGYG NOR 



Seq ID NO: 28 DMA Sequence 

Nucleic Acid Accession #= NM_000756.1 

Coding sequence :186 . .776 

AGAAACTCAG LaCCAAGTC CATTOAGAGA CIGAGGGGAA AGAGAGGAGA GAAAGAAAAA 60 

GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC OSGAGACGTC 120 

TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGGGAAG OSCCTGGGAA GOBAGTGCCC 180 

CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGABICCT GCTGGTGGCT CTCCTGCCCT 24 0 

GCCCGCCATG CAGGOCGCTC CTGAGCCGOS GGCOMTCCC GGQAGCTCGG CAGGCGCCGC 3 00 

AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCOSAGCAG CCCCAGCAGC 360 

CgSgGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 420 

TCAACAAGAG CCCGGCCGCT CCCCTTTCGC CCGtKTCCTC GCTCCTOSCC GGAGGCAOCG 480 

G^C^CCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC E40 

?GCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCI CGCGGAGCGC GGCSCTAGGA 600 

ATGCCCTCGG CGGCCACCAG GAGGCACOSG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGRGC 720 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 780 

GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GPAAO^ 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGCTG™ AAGAGAATGT AGACCTGTGA GAAAACGTIT TGTTTGAAAA AGCAGACAGA 1080 

Ig?cAC?cL TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGIT GIX3TAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 
Seq ID NO! 29 Protein Sequence 



MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 
ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS HPSPEQATAH FFRVLLQQLL 
LPRRSLDSPA ALAERGARMA LGGHQEAPER ERRSEEPPIS LDLTPHLLRE VLEMAHABQL 
AQQAHSNRKL MEIIGK 



ItTTCTTGAG ATGGCTCCTC TTGGRACTAT GCATCTCTCC TGATACTTGG ItGCTTTTCC 60 
TCTGACTGAT GAAGATCCTG AATACCAAAG AGGGCOSCTG AAAGGGCTCT AGGAGTACAC 120 
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CTTCTAGGAA CCCTAAGCCA GAGAGAGGCT TCACTACATC ATGCTTCCTG ACATCTCTCC 
CTTTGAAGAG CAGTCAGACT CCTGCTTTGC TCTTmOACT TAATTTGGGG GTTTAACAGG 
TGAGGTTGCT GGGGGARCTC TTTTACAACA TCTCTCTGAA AGAATCCGGG CTGCCAQTTT 
CATTTGGTTT GGGTGTCAGT AGCATGATGG AAAGACAAAA AAACACftACT TGACATCTGC 
AGAAATGGGT TCAAATTTTA CCTGCAACTC ACCAATTCTG TGGCCTTGGT TCAGCAATTA 
AACTCCCTAA AftTTCAGTTT TTTCTTTGTA AAATGGGGTT ATGRACAGTA CCTACTTCAA 
AATGTGTTTG TGAAGATTAA AAAAGTTABC ATAAAGAGTT TAGAAGASTG TCTGGCATAT 
TGTGCTCAAT AAGTGTTTAT TTATTTATTG CTGAATAAAC CAGTAATITA ATTAGTAT 



I I I I I I 

CACCTCAGCC CCAACCCCAA CCCCAGCCGC ATCCCCTGCC CCAGCTGACG GGTCAAAGCC < 
TCAGGAGAGT GTGGCTCTCC CCAGGCGCTA CCAGGAGGGG CAGGTCTCAG CCAGCTGGGG 
AflACCTTATT GCCATGGTTC TTAGAAGCCA CCCCTTCCCC AGGCAAGACA GGCCCCAAGG 
GAGTGTCCCG AGGGCGGTTC CCGGGAGCCC CGTGGGTCCC AGCACTTCCA CACACTCTGA : 
GGACAGACAC GGCCCCTCTT CTTCAGTGGG GACAGTCATA GGGACAGGTA CAGGGGGCCT 
GGTTGAGGCT GGAGGTCAGC CACAGCCAAQ AAGCTCCGAG ACCAACGGAT CGCCCAGCCC 
AGACCCTCCC CCAGGCCTAA GAGGAGAGGG AACCAGGGAG AAAAGICTAG ACCCGCTGCC 
CCAAGCCGCG ATGCCCAGGG GCCCCGCACR GCCCCCCGOS CAGAGGCCGC CTGGCCCCGC 
GGCCTCCTCC TCTGCGAGGC GCTCACAGCC GGTACCCCAG CTACGGAAAC GCAGCAGGTG 
CGAAATCGCC COaGCTCGG AGCAGGAGGT CAGGCCGGCC GCCTCGGGGG ACCCTCAAGG 
GGAGGCGCOS GGGGRGGGGG GCAGCXCTGC CGGCCGCAGC GGGGCGCTCA OSGRAAAGCA 
GGAGGAGGCC aSGAAGCTCA TGGTGTTTCT GCAGAGGCCC GGGGGTTGGG GGGTGGTGGA 
GGGGCCCCGG AAGCCCAGCT CCOSGGCCCT GGAGCCCGCC ACGGCGGCAG CCCIGCGGCG 
GCGGCTGGRC CTGGGCAGTT GCCTGGACGT GCTGGCCTTT GCCCAGCAGC AOSGAGAGCC 
CGGCCTGGCG CAGGAGACCT ftCGOSCTGAT GAGCGACAAC CTGCTGCGAG TGCTGGGAGA 
CCCGTGCCTC TACCGCCGGC TGAGCGCGGC CGACOSCGAG CGCftTCCTCA GCCTGCGGAC 
CGGCCGGGGC OSGGCGGTGC TGGGCGTCCT OSTACTGCCC AGCCTCTACC AGGGGGGCCG 
CTCAGGGCTC CCEAGGGGCC CTaSTGGOSA GGAGCCTCCT GCGGCGGCCC CTGTGTCCCT 
GCCTCTACCT GCGCACCTGC ATGTOTTCAA CCCCOGGGAG AACACCTGGC GGCCCCTGAC 
CCAGGTGCCC GAGGAGGCCC CGCTTCGGGG CTGCGGTCTC TGCACCATGC ACftACTACCT 
GTTTCTGGCG GGGGGCATCC GTGGCTCCGG TGCCRAGGCC GTCTGCTCCA AOSAGGTCTT 
CTGCTACAAC CCTCTGACCA ACATCTGGAG CCAGGTTCGG CCCATGCAGC AGGCCCX3AGC 
CCAGCTCAAG CTGQTGGCCC TGGACGGGCT GCTCTATGCC ATCGGTGGCG AATGCCTGTA 
CAGCATGGAG TGCTACGACC CGOSAACAGA CGCCTGGACC CCRCGCGCGC CACTCCCC3GC 
AGGCACCTTC CCTGTGGCCC ACGAGGCTGT GGCCIGCCGT GGGGACATCT ACGTCACCGG 
GGGTCACCTC TTCTACCGCC TGCTCAGGTA CAGCCCCGTG AAGGATGCTT GGGAOSAGTG 
CCCATACAGT GCCAGCCACC GQCGTTCCAG CBACATCGIG GCACTGGGGG GCTTCCTGTA 
CCGCTTCGAC CTGCTGCGGG GCGTGGGCGC CGCCGTGATG OGCTACRACA CAGTGACCGG 
CTCCTGGAGC AGGGCTGCCT CCCTGCCCCT GCCCGCCCCC GCCCCACTGC ACTGCACCAC 
CCTGGGCAAC ACCATTTACT GCCTCAACCC CCAGGTCACT GCCACCTTCA CGGTCTCTGG 
GGGGACTGCC CAGTTCCAGG CCAAGGAGCT GCAGCCCTTC CCCTTGGGGA GCACCGGGGT 
CCTCACTCCA TTCATCCTGA CTCTGCCCCC TGAGGACCGG CTGCAGACCT CRCTCTGAGT 
GGCAGGCAGA GAACCAAAGC TGCTTCGCTG CTCTCCAGGG AGACCCTCCT GGGATGGGCC 
TGAGAGGCCG GGGCTCAGGG AAGGGGCTGG GATCGGAACT TCCTGCTCTT GTTTCTGGAC 
AACTTTCCCC TTCTGCTTIA AAGGTTGTCG ATTATTTIGA AGCCCAGACT CCCTCAOCCT 
CTTTCTGCCC CTCACTCCAC ACCCAGACTG TTTCCTGACT CAATTCCGTA CCTACTTACA 
GAC-CTCTCA GCTTGCTGAC ACCCCCCTGT CTGTGGGACT CCCTATTCCC TAGAGCCAGG 
GACTCATGCG TCTCCACAGA CRAGGACTTG GCTCGCTGGA GCTCTGCTGA GCCGAGAGAG 
GAGGGGGTAG AAAftCATTCA CACTTCCTAT GCICTGTCAG CAGGACAGGG AGCAAAAACG 
TCCCCAGGCA ACGCCCTCGC CTGTGGGACT TTCTGCCTGT CCTAAGGCCI CCCCAGGTAC 
CAACCCCGTA GCTATCTGGG TCTGITTGGC ACTGTGGATT CTCAflGGGCC TAGAACCCTT 
GCCTCTGAAA CTGGTCCGCT GGTGCAGCCC TGCTGTCTGC AGCTCCTGCC CATACCCCCA 
GCCCACACCA GGCCAGGCCC ACTCCGGGCT CACCACCCTC TGCAGCCTTG TGGGGCTCTC 
CCAGCKCCIC CAGAAGCCCA CCCCACTTCT CGCCAACCCC CGATCTCTAA ATGAGGCCTG 
AGOBTCACCC TAGTTCTGCC CCTTTTTAGC TGTGTAGACT TGGACGAGAC ATTTGACTTC 
CCTTTCTCCT TGTCTATAAA ATGTGGACAG IGGACGTCTG TCACCCAAGA GAGTTGTGGG 
AGACAAGATC ACAGCTATGA GCACCTCGCA CGGTGTCCAG GATGCACAGC ACAATCCATG 
MGCGTTTTC TCCCCTTACG CACTTTGAAA CCCATGCTAG AAAAGTGAAT ACATCTGACT 
GTGCTCCRCT CCAACCTCCR GCCTGGATGT CCCTGTCTGG GCCCTTTTTC TGTTTTTTAI 
TCTATGTTCA GCACCACTGG CACCS^TAC ATTTTAATTC ACCGAAAGCA 
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MPRGPAQPPA QRPSGPAASS SARRSQPVPQ LRKESRCEIA PSSEQEVRPA ASGDPQGEAP 
G^GSPAGRS GALTBKQEEA RKLMVFLQRP GGWGWEGPR KPSSRALEPA TAAALRRRLD 



GEGGSPAGRS GALTBKQBBA KAliravri.uKi' cuimij vv iiui-k j^jrooivr>""x„ — 

LGSCLDVLAF AQQHGEPGLA QETYALMSDN LLHVLGDPCL VERLSAADRE RILSLRTGRG 
RAVLGVLVLP aVQGGRSGL PRGPRGEEPP AAAPVSLPLP AHLHVFNPRE NTWRPLTQVP 
EEAPLRGCGb CTMHNYI.FLA GGIRGSGAKA VCSNEVFCYN PLTNIWSQVR PMQQAEAQLK 
LVALDGLLYA IGGECLYSME CYDPRTDAWT PHAPLPAGTF PVAHEAVACR GDtYVTGGHL 
FYRLLRYSPV KDAWDECPYS ftSHRRSSDlV ALGGFLYRPD LLRGVGAAVM pvMTn;T.r:™= 
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I 1 1 

3 RAAARACCOS TTTCCa^GA AGGCCTCTftA AAGGCCAAGA 

3 CAGACTTTAC AAACAGAGTG TTTCCTAACT GCTCTATGftA CAGAAAGGTT 
AAACTCTGTQ AGTTGAACCA ACACATCACR ACGCAGTTTG TGGGAATGRT TCTGTCTAGT 
A AGATATTTCC TTTTCTGCCG TTGACCTTAA AGCGCTTGAA ATCTACaCTT 
A CAAATAGAGT GTTTCAAATC TGCTCTGTCT AAGGQAAOST TCAACTCTGT 
C AAGGAAGTTA CTGGGAATTC TTCTGTCTAG CCTTACSiTGA 
C GTTTCCAACG AAGGCCTCTA AGTGGTCAAA ATATCCACGT GCAGACTTTA 

T GTTTCCAAAC TGCTGAATGA AAAGAAAAGT AAACTCTGAG AGTTGAACGC 

ACACATCGCA GAGCAGTTTC TGAGAATGAT TCTGTCTAGT TTTTATACGA AGATATTTCC 
TTTTCTGCCT TTGGCCTCAA AGCGCTTGAA ATCTCCACTT GCAAATTCCA CAAAAAGAGT 
GTTTCAAATC TGCTCTGTCT AAAGGAAGGT TCAACTCTGT QAGTTGAATA CACACAACAC 
AAAGAAGTTA CTGAGAATTC TTCTGTCTAG CATTATATGA AGAAATCCCQ TTTCCAACGA 
A GAGGTCCAAA TATCCACTTG CAGACTTTAC AAACAGAGGG TTTCCAGAAT 
A AAAGAAAGGT TAAACTCTGT GAGTTAAACA CACA 



D NO: 34 DNA sequence 

ic Acid Accession AF0114Sa. 



GGAAGACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC CGTGGGQQAT 
ATCTCAGTGG CGGACGAQGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 
CAAGTCCCCT GTCGGTTCCT CCGTCCCTGA GTGTCCTTQG CGCTGCCTTG TGCCCGCCCA 
GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTSTA GGATACTGCT TGTTACTTAT 
TACAGCTAGA GGCATCATQG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 
TACAGCTCCR GTTGGAGGTC CAAAACGTGT TCTCQTGACT CAGCAAATTC CTTGTCAGAA 
TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 
GCGCGTTCCT TTQCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 
GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 
CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 
ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 
TGGTCGCCCT CTGGGTAAAG GAAAGITTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 
CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTQQAQAAAG CCGGAGTGGA 
GCATCAGCTC AGAAQAQAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 
ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 
TGGAACAGTT TATAGAGAAC TTCRGAAACT TTCAAAQTTT GATGAGCAGA SAACTGCTAC 
TTATATAACA GAATTGGCAA ATGCCCTGIC TTACTGTCAT TCGAAGAGAG TTATTCATAG 
AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 
TGGGTOGTCA GTACATGCTC CATCTTCCAG QAQQACCACT CTCTGTGGCA CCCTGGACTA 
CCTQCCCCCT GAAATGATTG AAGGTCGGAT GCATGATGAG AAGGTGQATC TCTGGAGCCT 
TGGAGTTCTT TGCTATGAAT ITTTAGTTGG GAAGCCTCCT TTTGAGGCAA ACACATACCA 
AGAGACCTAC AAAAGAATAT CACGGGTTGA ATTCACATTC CCTGACTTTG TAACRGAGGG 
AGCCAGGGAC CTCATTTCAA GACTGTTQAA GCATAATCCC AGCCAGAGGC CAATGCTCAG 
AGAAGTACTT GAACACCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTGCCAAAA 
ransnaATCA GCTAGCAAAC AGTCTTAGGA ATCGTGCAGG GGGAGAAATC CTTGAGCCAG 
GQCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT ATTTTACCAT TGACTGCTGC 
CCTCAATCTA GAACGCTACA CAA.GAAATAT TTGTTTTACT CftGCAG°™T 0=™=^ 
CCCTATTCAG AAAGCTCCAC ATCAATAAAC ATGACACTCT GAAGTGAAAG TAGCCACGAQ 
AATTGTGCTA CTTATACTGG TTCATAATCT GGAGGCAAGG TTCGACTGm GCOGCCCCGT 
CA3CCTGTGC TAGGCATGGT GTCTTCACAG GAG3CAAATC CAGAQCCTQG 
AGTGACCACT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC TTCCTAQTAC 
CTGAGTGAGT GTGTAACTTA TTGGGTTGGC GAAGCCTGGT .^GCTGTTG GAATGAGTAT 
GTGATTCTTT TTAAGTATGA AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 
GGCATTCCTT TAGGAATGCT QTGTGTCTGT CCGGCACCCC GGTAGGCCTO ATTGGQTTTC 
TAGTCCTcS ?^C^CTTA TCTCCCATAT GAGAGTGTGA AAAATAGGAA CACGTGCTCT 
MCTCCATIT AGGGATTTGC TTGGGATACR GAAGAGQCCA TGTGTCTCAG AGCTGTTAAG 
^CTTATTTT TTTAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATQCAAATA 
AATAAGTATC TATGTCTAAA ft' 



Seq ID NO; 35 ^''"''^'•"j^^ljggj^j^ 

] Y r " i" " 

ioRSKENClS GPVKATAPVG GPKRVLVTQQ IPCQNPLPVN SGQAQRVLCP SNSSQRVPLQ 60 

MKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLWTQKSKQ PLPSAPENNP EEELASKQKN 120 

EESKKRQWAL E^PBIGRPLG KGKFGSVYLA REKQSKPIL* LKVLFKAQLE KAGVEHQLRR 180 

EVEIQSHLRH PNILRLYGYP HDATRVYLIL EYAPLGTWR ELQKLSKPDE QRTATYITEL 240 

ANALSYCHSK RVIHRDIKPE HLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 
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11 21 31 41 51 

I I I I I 

3 ACAC3AATCAG GTGATGGTCC AGAATTAAGA GCTGTCACCT GTGTCATTCA 
3 AAGRAATGAA GAAGACTGCC ATCCGGCTGC CCAAAGGCAA ACAGAAGCCT 
3 AAT3GAATTC CCGGTGTGTC CTTTTCRCCT ACTTCCAAGG GGACATCAGC 

3 ATGAACACTT CTCCAGAGCT CTGAGCAATA TCAAGAGCCC CCAGGAATTG 

ACCCCCTCGA GTCAGAGTGA AGGTGTGATG CTGAAAAACG ATGATAGCAT GTCTCCARAT 
CAGTGGCGTT ACTCGTCTCC ATGGACAAAG CCRCAACCAG AAGTACCTGT CACAAACCGT 
GCCGCCAACT GCAACTTQCA TGTGCCTGGT CCCATGGCTG TGAATCAGTT C 
CTGGCTAGGA GGGCCTCTGT TCGGCCTGGG GAGCTGTGGC ATTTCTCCTC C 
ACCAGCICCT TAGAGCCTGG CTACTCTCAT CCCTTCCCIX5 CTCGGCRCCT G 
CCCCAGCCTG ATQGGAAACG TGAGCCTCTC CTAAGTCTCC TCCAGCAAGA C 
GCCCGTCCTC AGGAATCTGC CGCCAGGGAG AATGGCAACC CTGGCCAGAT A 
ACAGGGTTGC TCTTCAACCT GCCTCCCGGC TCAGTTCACT ATAAGAAACT ATATGTATCT 
CGTGGATCTG CCAGTACCAG CCTTCCAAAT GAAACTCTTT CAGAGTTAGA GACACCTGGG 
AAATACTCAC TTACACCACC AAACCACTGG GGCCACCCAC ATCGATACCT GCAGCATCTT 
TAGTCAAGTT GGAGGAGAAA GACAACACTTT GGTCTAAGAC ACGGCAGCAA GACATCCCTG 
CATATTGTTC CAGATAAAAA TGAAAGCTGC TCACACCCAC TTGCCTCCCC AATCTGTTAA 
ACAGCTTCGT GTCTAGTATG AGCTCAGTAC TTGCCCTGTG AAAATCCCaG AAGCCCCCGC 
TGTCAATGTT CCCCATCCAC ACCCTGCTTG CTCCTGTGTA ACAGCTCAGA TGATGAATAA 
A AAAAAAAAAA AAAAAA 



I I I I I I 

MEEMKKTAIR LPKGKQKPIK TEWHSRCVLF TYPQGDISSV VDEHPSRALS NIKSPQELTP 
PSSQSBGVML KNDDSMSPNQ WRYSSPWTKP QPEVPVTNKA ANCNLHVPGP MAVNQFSPSL 
AARRASVRPG ELWHPSSLAG TSSLEPGYSH PPPARHLVPE PQPDGKREPL LSLLQQDRCL 
ARRPQESAAE ENGNPGQIAG STGLLEWLPP GSVHYKKLYV SRGSASTSLP NETLSELETP 
GKVYSLTPPN H 



Lgaaaccaa ggcaagctcc ccctgtcaaa gcaccttggc ccataagaag aaaaggggga 

GCCCCAGATG TGATGAGCGC TTCC»GGCTT CRGGCTCAQA RGGCGCCCCC AGCTCTCCTG 
TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC RCaCTTCCCC TGTAAACaCG 
CCTGTGGTGG GCAAAAGGQC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GCGTAATTTC 
CACTTTCATT CATGATAATG TCGAACACGC ACAAAQCTCG GCTGGAACGC CGGGTCACTG 
GCTCAACCAA CCGGTGGCGT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 
CCCAGATGTG CAAGQCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCaGACaGGT 
TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 
GTGGGGAAGC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CaTCAGCCCC 
GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAQGQCaCCA 
CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 
AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA =TGGTCACTA 
AAGTCGCCTT CGCCACGGCC CTGAAGAP.CC TCTACATGAG TGAGGTGGAG ATTAACTTGG 
AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 
GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 
CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 
ACTTGGTTCC TCTAGQGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 
ACAAAGTSCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 
TGCITTTGTG GGTCTTCTTG CAA.CTGAACT ACAAGA7TCA GGCAATTCCG ACTTATGAAA 
CCQTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 
GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 
ATCTCGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATQGCTCGAC CAGGTTACAG 
TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CC?.CCT=AAA GATCTTAACA 
CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCA.GGAG^A TRCAACTTAT TCGAftAACGA 
TTGCTCTATA TGGATTCTTC TTTAAGATAA AG3GACTCAA ACATGATACT ACCICTTATA 
GTTTTTACAT GCA3AGAATA AAGCACACAG ACCTGGA;>.TC TCCCTCTGCG GTCTACGAGC 
ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TC-AGATCAGA GCAGAGGCCC 

tggttgacgg caagtggcag gagttcagga caaaccagat caagcagaag tttgggttga 

CCACGTCATC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 
TAAGTTTTGC ATTCATCTTC CCAGCATCTT GRCAGTTrCC AGAAGAATCT ATGGGATTTT 
CCCCCCRCTG GTCTGCATAA AAGAAAATAA AATGACRTAA AAGQQRGC 
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I I I I I 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA L 
QIQKFPPENP KNKDIQSGEA DVILECLGFK WELHQPRLFQ SETLAKLYLK ALAQGTTHPL 
RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAP ATALKNLYMS EVEINLEDLL 
GVLASAHILQ PSGLPaRCVD VMIRRLKPST IKKFYEAGCK YKEEOLTIGC EKWLEMNLVP 
LGGTQIHLHK IPQDLLHKVL KSPRLPTFSE FHLLKTMLLW VFLQLNYKIQ RIPTYBTVMT 
PFKSFPENCC FLDRDIGRSL RPLFLCLRLH SITKGKDLEV LRHLNFFPES WLDQVTVNHY 
HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GFFFKIKQLK HDTTSYSFYM 
QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KWQEPRTNQl KQKFGLTTSS 
CKSHTIiKIQT VGIPIYVSFR FIFPAS 



Ssq ID NO: 40 DNA S 

Nucleic Acid Accession #i NM_032899.1 
Coding sequence: 186.. 1070 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGR TTGCTCAGGA CAGCGGTAAA TCACTTCTTG GAGGTGCCCT GCACGCOGGT f 
CCTGGGAGC» GGQGGCCTCC CGGGQGTGCG GGAGCCCCAC TCCTCCGTGG TGTGTTCCAT 
TTGCTTCCCa CATCTGGAGG AGCTGACGTG CCAGCCTCCC CCAGCACCAC CCAGGGACGG 
GAGQCATGAG CCQGTCAfiGG CRCCTGGGCR AAATCCGGAA GOSTCTGGftA GATGTCAAGA : 
GCCAGTGGGT CCGGCCAGCC AGGGCTGACT TTAQTGACAA CGAGRGTGCX: CGGCTGGCCA 
CGGACGCCCr CTTCGATGGG GGTTCTGAAG CCTACTGGCG GGTGCTCAGC CAGGAAGGCG 
AGGTGGACTT CTTGTCCTCG GTGGAGGCCC AGTACATCCA GGCCCAGGCC AGGGAGCCCC 
CGTGTCCCCC AGRCACCCTG GGAGGGGCGG AAGCAGGCCC TAAGGGACTG GACTCCAGCT ■ 
CCCTACa^TC CGGCACCIAC TTCCCTGTGG CCTCAGAGGG CAGCGAGCXB GCKCTACTGC I 
ACAGCTGQGC CTCAGCTGAG AAGCCXTACC TGftAGGAAAA ATCCAGCGCC ACTGTGTACT ( 
TCCAGACCGT t»AGCACAAC AACATCAGAG ACKTCGTCCG CCGCTGCATC ACCCGGACTA 
GCCAGAACAT ttcx:atccgg AGTGTGGAAG GAGAGATATA CTGTGCCAAG TCSGGCRGGA 
AATTCACTGG CCRAATCCGG GAGAAGTTCA TCATCTCGGA CTGGAGATTT GTCCTGTCTG 
GATCTTACAG CTTCACCTGG CTCTGCGGAC ACGTGCACCG GAACATCCTC TCCAAGTTCA 
CAGGCCAGGC GGTGGAGCl'G TTTGACGAGG AGTTCCGCC» CCTCTACBCC TCCTCCaAGC 

ctgtgatggg cctgaagtcc ccgcggctgg tcgcccccgt ccxbccogga gcaqckccgq 

CCA^TGGCCG CCTTAGCAGC AGCAGTGGCT CCGCCAGTGA CCGCACGTCC TCCAACCCCT 
TCAGCGGCCG CTCGGCAGGC AGCCACCCCG GTACCCGAAC TGACGGCIGA gatgaggtta 
GAATGACTGG GCCCGGCTGA ACATTCCAAA TTGGATTTCA CCaTCTGCTG AQAAAGTTTA 

aggaaggcaa agcttgccag gtcacagaag ctcccaagcc cagctttcca aaggcctcag 

CCTGTGCCTG TGTCGAGCTC AGTCCTGGGA GATAGGGGAG AACCTGCAGG CAGGAACAAG 
CCCCCCTACT CCTGACCACC CTCCATCAGC AGTCTCCCCT CCQTGQTCQT CTTTGTTGAC 
AAAGGTGCAG TTTCTCCTCT CCTGGGCACC TGTAACATGT GATGCGCTGC CTGCIGGGAQ 
GTTAGGTCGG GGCTOCCCCG GCGAGTGQAQ CATGAGCAGA ACCGCCGAGG GTCACTTCTG 
GGCAGAAGCT TTGRGAGCCI GGGTCCAGGT TGCCACATAQ AAQCAGCTCT CCAGTTCAAA 
CCCTCCTCTQ CCAGCCTGGG GTCCTAAGCG ATGAGCAGAA TCCCCCACTC CCACKCCACC 
AACCCACAAT GGATATGTAG TQAQCAAGAA ATAAACCTTT GTTGnTAAA AAAAAAAAAA 
AAAAAAAA 

seq ID NO: 41 Protein Sequence 
Protein Accession tt. NP_116288.1 

21 31 4X 51 

MSRSRHLGKI RKRLEDVKSQ WVRPARADFS DNESARLATD ALLDGGSBAY WRVLSQEGEV 
DFLSSVEAQY IQAQAREPPC PPDTLGGAEA GPKGLDSSSL QSGTYFPVAS EGSEPALLHS 
MASREKPYLK EKSSATVYFQ TVKHNNIRDL VRRCITRTSQ NISIRSVEGE lYCAKSGRKF 
TQQIREKFII SDWRFVLSGS YSPTWLCGHV HRHILSKFTQ QAVELPDEEP RHLYASSKPV 
MGLKSPRLVA PVPPQAAPAN GRLSSSSGSA SDRTSSNPFS GRSAGSHPGT RTDG 

Seq ID NO: 42 DNA sequence 

Nucleic Acid Accession #: NM_000782.2 

Coding sequence: 405.. 1946 

1 11 21 31 41 51 

1 1 1 1 I I 

TGGAGAGGGA CAGGAGGAAA CGCAGCGCCA GCAGCATCTC ATCTACCCTC CTTGACACCT 
CCCCGTOQCT CCAGCAAGCC CTAGAGGTCA GCCTTGCGGA CCAACAGGAG GACTCCCAGC 
TITCCCTTTT CAAGAGGTAC CCCAGACACC GGCCACCCTC TTCCAGCCCC TGCGGCCAGT 
BCAAGGAGGC ACCRATGCTC TGAGGCTGTC QCQTQQTGCA GCGTCGAGCA TCCTCGCCGA 
GICCTTCTQC TCCCTGTCCC GCCTCACCCC GCTCCATCAC ACCAGCTGGC CCTCITTGCT 
TCCTTTTCCC AGAATCGTTA AGCCCCGACT CCCftCTAGCA CCTOSTACCR ACCTCGCCCC 
ACCCCATCCT CCTGCCTTCC CGCGCTCCQ3 TQTCCCCCGC TGCCATGAGC TCCCCCATCA 
GCAAQAGCCG CTCGCTTGCC GCCTTCCTGC AGCAGCTGCG CAQTCCGAGG CAGCCCCCGA 
GACTGGTGAC ATCTACGGCQ TACACGTCCC CTCAGCCGCG AGAGSTGCm GTCTGCCCQC 
TGACAGCTGG TGGCGAGACT CAGAACQCQG CCQCCCTGCC GGGCCCCACC AGCTGGCCAC 
TGCTGGCGAG CCIGCTOCAG ATTCTCTGGA AAGGGGGTCT CAAGAAACAG CAOGACACCC 
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TGGTGGAGTA CCACAAGARG TATGGCAAGA TTTTCCGCAT GAAGTTGOQT TCCTTTGAGT 
CGGTGCACCT GGGCTCGCCA TGCCTGCTGG AAGCGCTGTR CCGCACCGAG AGCGTACCCC 
AGCGGCTGGA GATCAAACCG TGGAAGGCCT ATCGOGACTR CCGCAAAGAA CSSCTACGGGC 
TGCTGATCCT GGAAGGGGAA GACTGGCAGC aOGTCCQQAG TGCCTTTCAA ARGAAACTAR 
TGAAACCAGG GGAAGTGRTG AAGCTGGACA ACAARRTCAA TGAGGTCTTG QCCGATTTTA 
TGGGCAGAAT AGATGAGCTC TGTGATGAAA GAGGCCACGT CGAAGACTTG TACAGCGAAC 
TGAACAAATG GTCGTTTGAA AGTATCTGCC TCGTGTTGTA TGAGAAGAGA TTTGGGCTTC 
TCCAGAAGAA TGCAGGGGAT GAAGCTGTGA ACTTCATCAT GGCCATCSiAA ACAATGATGA 
GCACGTTTGG GAGGATGATG GTCACTCCAG TCGRGCTGCR CAAGAGCCTC AACACCAAGG 
TCTGGCAGGG ACACACTCTG GCCTGGGACA CCRTTTTCAA ATCAGTCAAA GCTTGTATCG 
T AGAGAAGTAT TCTCAGCAGC CTAGTGCAGA TTTCCTTTGT GACATTTATC 
G GCTTTCAAAG AAAGAATTGT ATGCTGCTGT CACAGRGCTC CAGCTGGCTG 
GACAGCAAAC AGTCTAATGT GGATTCTCTA CAATTTATCC CGTAATCCCC 
3CTTCTT AAGGAAATTC AAAGTGTATT ACCTGAGAAT CAGAGGCCaC 
h TTTGAGGAAT ATGCCGTATT TA-OAAGCCTG TCTGAAAGAA TCTATGAQGC 



C CAAAGGAACA GTGCTCATGC TAAATACCCA GGTGTTGGGA TCCAGTGAAG 
„^~..,..„A AGATTCAAGT CAGTTTAGAC CTGAACGTTG GCTTCAGGAG AAGGAAAAAA 
TTAATCCTTT TGCGCATCTT CCATTTGGCG TTGGAAAAAG AJ.TGTGCATT GGTCGCCGAT 
T TCAACTGCAT TTGGCTCTTT GTTGGATTGT CCGCAAATAC GACATCCflGG 
^ TGAGCCTGTT GAGATGCTAC ACTCAGGCAC CCTGGTGCCC AGCCGGGAAC 
TTGC TAACATCATA 
3 GTTCACTGCT 



TCACCTGTGT CAGCACCRTC TTCAGGTGCT TAGAATGGCC TG3GAG 
GGAGGCTGGC ACTTGTCAGT CAGGTA 



TTACAAACCG TTTCAGGCCC TGCTACCACA TTCACTGTTT GAATCTTTAA 
TAAGTTTACA TTTCACAATG AATGACCIAC AACAGCTAAA TTTTCTGGGG CTGGGAGTAA 
TACTGACAAT CCATTTACTG TAGCTCTGCT TAATGTACTA CTTAGGAAAA. T3TCCCTGCT 
TAATAATGTA AGCCAAGCTA AATGATGGTT AAAGTTATCA GGCCTCCCAT G.=AATTGCGT 
TCTTCCTGCA TTGAAATAAA AACATTATTG GGAAACTAGA GAACACCTCT ATTTTTAAAA 
GGACTTTAAC GAAGTCARAC ARCTTCTAAG ACTAGTGATT CACTGGGGCA TTATTTGTTA 
GAGGACCTTA AAATTGTTTA TTTTTTAAAT GTGATTCCTT TATGGCATTA GGGTAAAGAT 
GAAGCARTAA TTTTTAAATT GTGTAIGTGC RTATGAAGCA CAGACATGCA TGTGTGTGTG 
TGTCTGTGTG TGTGTGTCCG TGTATOTGTG TGTGGGTTCT AATGGTAATT TGCCTCAGTC 
ATTTTTTTAA TATTTGCRGT RCTTGRTTTA GGATCTGTGG TGCRGGGCAT GTTTCAAAGT 
TTAGTCACAG CTTRAAAACA TTCAGTGTGA CTTTAATATT ATAAAATGAT TTCCCATGCC 
ATAATTTTTC TGTCTATTAA RTGGGRCAAG TGTARAGCAT GCAAAAGTTA GAGATCTGTT 
ATATAACATT TGTTTTGTGA TTTGAACTCC TAGGRAAAAT ATGRTTTCRT AAATGTAAAA 
TGCACAGAAA TGCATGCAAT RCTTRTAAGA CTTARAAATT GTGTTTACAG ATGGTTTATT 
TGTGCATATT TTTACTACTG CTTTTCCTAR RTGCaTACTG TATATARTTC TGTGTATTTG 
ATAAATATTT CTTCCTRCAT TATATTTTTA GAATATTTCA GAAATATACA TTTATGTCTT 
TATATTGTAA TAAATATGTA CATATCTAGG TATATGCTTT CTCTCTGCTG TGAAATTATT 
TTTAGAATTA TAATTCaCGT CTTSTCATAT TTCRTCTGTA TACCTTCAAA TTCTCTGARA 
GTAAAAATAA AAGTTTTTAA ATATT 



I I I 1 1 I 

AsSPISKSRS LAAFLQQLRS PRQPPRLVTS TAYTSPQPRE VPVCPLTAGG ETQNAAM,PG ^ 
PTSWPLLASL LQILWKGGLK KQHDTLVEYH KKVGKIFRMK LGSPBSVHLG SPCLLEALYR 
TESVPQRLEI KPWKAYRDYR KEGYGLLILE GEDWQRVRSA FQKKLMKPGE VMKLDNKINE 
VLADFMGRID ELCDERGHVE DLYSELNKWS PESICLVLYE KEPGLLQKNR GDEAVNFIMA 
IKTMMSTFGR MMVTPVELHK SLNTKVWQGH TLAWDTIPKS VKACIDNRLE KYSQQPSADP 
LCDIYHQNRL SKKELYAAVT ELQLAAVETT ANSLMWILYN LSRNPQVQQK LLKEIQSVLP 
ENQRPREEDL RHMPYLKACL KESMRLTPGV PFTTRTLDKA TVLGEYALPK GTVLMLNTQV 
LGSSEDNFED SSQFRPERWL QEKEKINPFA HLPFGVGKEM CIGRRLAELQ LHLALCWIVR 
KYDIQATDNE PVEMLHSGTL VPSRELPIAF GQR 



Lggcgtctg CGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 
GTTGTQGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC TGTTTGCTGA GACCATATGG 
OTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGAOSTC 
TTCGCTOGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 
GGTGTGGGTG CCGCACTCTG CCGCCX3CCQQ TCCRTGGTCC TCRCGTRCCT GGTGCTCATG 
CTCATCGTCT ACATCTTCX3A GTGCGCCTCX: TGCATCAOGT CCTACACCCA CCGTGACTAC 
ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATGCTGA CCTTCTACAG CGCGGACACC 
GACCAGGGCC AGGAGCTGAC CCGCCTCTGG GACCGCGTCA TGATTGAGCA AGAATGCTQT 
GGCACATCTG (JTCCCATGGA CTGGGTGAAC TTCACGTCAG CCTTCCGGGC GGCCACTCOG 
GAGGTGGTGT TCCCCTGGCC CCCACTGTGC TGTCGCCGGA OSGGAAACTT CATCCCCCTC 
AACQAGGAGG GCTGCCGCCT GGGGOiCRTG GRCTACCIGT TCACCAAGGG CTQCTTCGAA 
CACATOffiCC ACGCCATCGA CAQCTACaCG TGGGGTATCT CGTGGTTTGG GTTTGCCATC 
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CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCST GCTCTGAGGG 780 

ACAGGAGGGG AAGGCAACAT ACACACCCCG GACTCCTCCG CATCCTCCTC CTGCTTCCTC 840 

CGCTGGGCCT GGATGSCTGC CTCACCTCTC ACCTCCCAAC GTCCCTAGCC CTTACGTCCT 900 

TCCACTTCCA AGATCTTTTT CCAGGTTCCT GAGCCCTACT GTGTCTCAGG TGTGCCCTGR 960 

5 AACCCCA3GG CTTGTGTGCR CATATCCTTA GCCCATCTTT CAAGGGACCT CTCCATGATC 1020 

CCACCTCCCA TTCACMATR CCTCTCTTGT AGCTCTCTGA CCTCCTCCTT CRTQGCAGGC lOSO 

ATCGCCATTC TTGCTGAACC GTTTGTGATT GCCATTTGRG CTCTGGAAGC CTCTATTGCC 1140 

ATGAGAGTTC TGTCACGGTC ACTTTACTGT CCCCATCRTC ACCCAGCRCG GGGCTAAGCA 1200 
TATACTAGAT AGTCAATAAA TAR 



I I I I I I 

MASAAAAEAE KGSPWVGLL WGHIIILIiS GLSLFAETIW VTADQltRVYP LMGVSGKDDV 
PAGAWIAIPC GPSPFMVASP GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHHDY 
MVSNPSHTK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSQPMDWVN FTSAPRAATP 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWPQPAI 
LMWTLPVMLI AMYPYTML 

Seq ID NOt 46 DNA Becpience 

Nucleic Acid Accession «: NM_006760.1 

Coding sequence: 39.. 593 

j Y T V T T 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AGCCCRGCAT GGCACCCCTG CTGCCCATCC 
GGACCTTGCX: CTTGATCCTG ATTCTGCTGG CTCTGCTGTC CCCRGGGGCT GCAGACTTCA 
ACATCTCAAG CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 
TGCCCCCCTG TCRCCTCACA GGAGGCAATG CXIACACTGAT GGTCX:GGAGA QCX»ATGACA 
GCAAAGTQ3I QACQTCCAGC TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 
GTGTGGTGGA CAGTGGTGCT GGCTTCACAQ TCACTCGGCI CAGTGCaTRC CfiGGTGACAA 
ACCTOSTGCC AGGAACCAAA TTCTACATTT CCTACCTASI GAAGAAQQQG ACAGCCaCTG 
AGTCCAGCAG AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 
TGGGTATGGC CCGCACAGGG GGCATQQTQQ TCATCACGGT GCTGCTCTCT GTCGCCATGT 
TCCIGCTGGT GCTGGGCTTC ATCRTTGCCC TGGCRCTGGG CTCKCQCAAQ TAAGGRGGTC 
TGCCCGGAQC AQCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CRGCCCACCT 
GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 
CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCaGTGCC TCACCTTCCR 
ACRCTCCRTT ATTCCTCTCR CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACC» 
dTTAAACAC CCCCATAACSl ATTCCCCCAT CCTTCAGTGA ACIAAGTCCC TATAATAAAG 
GCIGAGGCTG CATCTGCCAA AAAAAAAAAA AA 

seq ID NO; 47 Protein Sequence 



I I I I I I 

MAPLLPIRTL PLILILLALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 
MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYQVTOLV PGTKFYISYL 
VKKGTATESS REIPMSTLPR RNMESIQLQM ARTGGMWIT VLLSVAMFLL VLGFIIALRL 



55 seq ID N, 
Nucleic , 



ATTGCTGATG QATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 
ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 
TCCCrCGAST TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 
GCTGTTCAGT CCAQGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 
CACIGACGGC CACGGTGGCT TCCTGQOGAA PAATGATGGC ATCAAAACAA AAAAAGAACT 
CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCC-AAG.!\P. TATCAGCTGC TGCTTCAGGT 

gacx:tatrga qrttccaagg agaaaasaga tttgagaaat tttctga.^gc tcttgaagcc 

TCCATTATIA TGGTCACATQ OQCTAATTAG AATTATCAGA GCAAAGGGTA CCACAGACTG 

cmcagccig aatggagtcc tgcagtgtac ctgtgaagac agctacacct ggtttcctcc 

CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGRGCACTCC CAAGCTGTGR 
ATSrCATCrC ARCAACCTCX QCKAQAQTGT CAATTTCTGT GAGAGAACAA RGATTTGGGG 
CACTTTCAAA ATTAATGARA GGTTTRCAAA TGACCTTTTG AATTCATCTT CTGCTATATA 
CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAARA GCRTATGAAA GAATTCAAGG 
TTTTGAGTCG GTTCRQQTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 
TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCRQCCRTT GAACATGTTG CCGAQAAGGC 
TARGACaQCC CTTCACAAGC TGTTTCCATT AGAAGAOGGC TCTTTCRGAQ TGTTCGGAAA = = 
RGCCCRGTGT AATQACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 102 
CTGCAGCAGT GGCTACAGGG GRAACATCAC AQCCAAGTGT GAGTCCTCTG GGTGGCAGGT 108 
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G ACTTGTGTGC TCTCTCTGCT TGAAGAACIG AACAAGAATT TCAGTAT(3AT 1140 

■nii*n,ui.AiiT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGTCGA TTCTGAGCAA 1260 

TATTTCATCT CTGTCACIGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 1320 

CATCAGTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 1380 

GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA TTAGAAAACA TCAGCACTCT 1440 

GGTGCCTCCG ACAGCTCTTC CTCTGAATTT TTCTCGGAAA TTCATTGACTT GGAAAGGGAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 1560 

AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 1680 

CATAAATGAA GTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCCACC TAGTGAATGA 1860 

AACTCAAGAC ATCGTGACGT GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

RCCTTTTGTC CCCTCTACAA TCTTCCCCGI TGTAAAATGG ATCACCTATG TGGGACTGGG 1980 

TATCrCCRTT GGAAGTCTCA TTTTATGCCT GATCATOSAG GCTTTGTTTT GC-AAGCAGAT 2040 

TAAAAAAAGC CAAACKTCTC ACACRCGTCG TATTTGCATG GTGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GRTGTCTGGT TTATTGTTGG TGCCACAGTG GACACCACGG TGAACCCTTC 2150 

TGGRQTCTGC RCAGCIGCTG TGTTCTTTAC ACACTTCTTC TACCTCTCTT TGTTCITCTG 2220 

GATGCTC^TG CrTQQCATCX: TGCTGGCTTA CCGGATCATC CTCGTGTTCC ATCACATGGC 22 BO 

CCAGCATTTG ATGATGGCTG TTGGATTTTG CCTGGQTTAT GGGTGCCXITC TCATTATATC 2340 

TGTCATTACC ATTGCTGTCA CGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTGTGTTG 2400 

GCTTAACTGG TCCARTQGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTG CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TOGIGCTGCT AGTTCTCACA AAGCTCTGGA GGCCGACTGT 2 520 

TGGGGAAAGA CTGAGTCGGG ATGACAAGGC CACCRTCATC CGCGTGGGGA AGAGCCTCCT 2580 

CATTCTGACC CCTCTQCTRG GGCICaCCTG GGGCTTTGGR ATAGGAACAA TAGTGGACAG 2640 

CCAGAATCTG GCTTGGCATG TTRTTTTTQC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTQTTCA ACAAGTTGTC 2760 

TGCCTTAAGT TCirGGAAGC AAACaGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTQCA AAACAAAGGC CATTATGCAT TTTCTCATRC 2880 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATQAAT AAGGCAAGGA 2940 
ATCATAAAAT CAAGAAAfiAA TTTCCAGAAC AACTTGACRT TTAGAGACAR ATQTCAATGA 3000 
AGfiAAlTRTG CTCAGTATTC GATCHGQTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 



1 11 21 31 41 5^ 

I I I I I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 
EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSaJ)PQ 
NCYLHTAGAL PSCECHLNHL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 
lEIQLKKAYE RIQGFESVQV TQFRNGSIVA GV^WGSf A ^E.LSAIEHV AEKAKTAM^ 



lEIQLKKAYE RIQGFESVgv Ty.^^^ai.vn ^x^v.^ooo^ .^„„.„--... 
LFPLEDGSFR VFGIWQCNDI VFGFGSKDDE YTLPLSSGYR GNITAKCESS GWQVIRETCV 
LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVIIS3NPST TVL,NLR.,WS ILSNISSLSL 
ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA ^^RLLETLEN "-^"PPTAL 
PLNFSRKFID WKGIETOKSQ LKRGYSYQIK MCPQ>TIS:PI EGKVLIGSDQ FQRSLPETII 
SMASLTLGNl LPVSKNGNAQ VNGPVISTVI QNYSIKEVFL FFSKIESNLS QPHCVPWDFS 
HLQWNDAGCH LVNETQDIVT CQCIKLTSFS ILMSPFVPST IFPWKIMTY VGLGISIGSL 
ILCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW flVGATVDTT 7"?!°^^^ 
VPPTHFFYLS LFFWMLMLGl LLAYRIILVF HHMAQHLMMA VGFCLGYGCP I'II_SV1TIAV 



VPPTHFFYLS LFFWMLMLGl LLAYRIiLVC Mm'i«ijniji-i..in .oc^xj^iv^v.. ' Z' ™ 

TQPSNTYKRK DVCMLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGEW;SR 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIPALLNAFQ GPFILCPGIL 

lSqllf nklsalsskk qtekqnssdl sakpkfskpf nplqnkghya pshtgdssdn 
imltqfvsne 



Seq ID NO; 50 DN 
Nucleic Acid Acc 
Coding sequence i 



CACGAATTTG TGGCGGAGAG GGCAAATAAC TGCGGCTCTC CCGGCGCCCC GATGCTCGCA 
CCATGTCGAG GCGCAAQCAG GCGAARCCCC AGCACATCAA CTCGGAGGAG GACCA3GGCG 
AGCAGCAGCC QCAGCAGCAG ACCCCGGAQT TTGCAGATGC GGCCCCAGCG ™CGCGG 
CGQGGGAGCT QGGTGCTCCR GTGAACCACC CAGGQAATGA CGAGGTGGCG AGTGAGGATG 
^GCCACAGT AAAQCGGCrr CGTCGGGAGG AGACGCACGT CTGTGAGAAA TGCTGTGCGG 
AGTTCTTCAG CATCTCTGAG TTCCTQGARC ATAAGAAAAA TTGCACTAAA AA.CCACCTS 
TCCTCATCAT GAATGACAGC GAGGGGCCIQ TQCCTICAGA AGACTTCTCC QGAGCTGTAC 
TCAGCCACcI GCcScCASr CCQGGCAGTA AGGACTGTC^ CAGGGAGAAT GGCGGCAGCI 
CAGAGGACAT GAAGGRGAAQ CCGGATCCGG AGTCTOTGGT GTACCTAAAG ACAGAGACAG 
CCCTGCCACC CACCCCCaG GACATAAQCT ATTTRGCCAA AGGCAAAGTG GCCAACACTA 
ATG?GACCTT GC^GCACTA CGGGGCACCA AGGTGGOGGT GRATCAGCGG AGCGCGGATG 
CACTCCCTGC CCCCGTGCCT GGIGCCAACA GCATCCCGTG GGTCCTCGAQ CAQ^TCTTGT 
GTCTGCAGCA GCAGCAGCTA CAGCAGATCC AGCTCACCSA GCAGATCCGC MCCAGGTGA 
ACATGTGGGC CTCCCACGCC CTCCACTCAA GCQQQQCAQQ QGCOSACACT CTGAAGACCT 
TGGGCAGCCA CATGTCTCAG CAGGTTTCTG CAGCTGTGGC TTTGCTC&QC CAGAAAGCTG 
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3 TCTGTCTCTG GATGCCTTGA AACAAGCCAA GCTACCTCAC G 

T CCCGAACGTC ATGTCCCGCC TCCCGAGCGC TTTGCTTCCT CAGGCCCCGG 1080 

a ctgtggcgct agacacatcc aagaaftggga 1140 
c caaagacgag gcggccctct 1200 

c tc-atagctcc 

acctccgctc ccacactgga gagagaccct t 
ccaccaaggg caacctcaaq g 

— :atcccc tatgcactct 
h ccccatagat gaaccgagtc tttctttaga cagcaaacct gtccttgtaa 

CCACCTCTGT AGGGCTACCT CAGAATCTTT CTTCX3GGGAC TAATCCCAAG GACCTCACGG 
C CTGCAGCCTG GGCCTTCTCC AGAARGTGRG GGTGGACCCA 

3 GGTGGGACCA AACTATAATT CCCCAAGGGC TGGTGGCTrC CAAGGGAGTG 

GGACCCCTGA GCCAGGGTCA GAGACCCTGA AATTGCAGCA GTTGGTGGAG AACATTGACA 
AGGCCACCAC TGATCCCAAC GAATGTCTCA TTTGOCACCG AGTCTTAAGC TGTCAGAGCT 
CCCTCAAGAT GCATTATCGC ACCCACACCG GGGAGRGACC GTTCCAGTGT AAGATCTGTG 
GCCGAGCCTT TTCTACCAAA GGTAACCTGA AGACACRCCT TGG3GTTCAC CGAACCAACA 
CATCCATTAA GACGCAGCAT TCGTGCCCCA TCTGCCAGA^ QAA3TTCACT AATGCCGTGA 
TGCTGCAGCA ACATATTCGG ATGCACATGG GCGGTCAGAT TCCCA^ 
AGAATCCCTG TGACTTTACG GSITCTGAGC C" " 

GCX5CTATCTG CCATGATGRT GTCATOSAAA GCATCGATGT AGAQGAAGTC AGCTCCCAGG 2160 

RGGCTCCCAG CMCTCCTCC AAGGTCCCCA CGCCTCTTCC CAGCATCCAC ICGGCATCAC 22 2 0 

CCACGCTRGG GTTTSCCATG ATGGCTTCCT TAGATGCCCC AGGGAAAGTG GGTCCTGCCC 2280 

CTTTTAACCT GCAGCGCCAG GGCAGCftGAG AAAACGGTTC CGTGGAGAGC GATGGCTTGA 2340 

CCAACGACTC ATCCTCGCTG ATGGGAGACC AGGAGTATCA GAGCCGAAGC CCAGATATCC 2400 

TGGAAACCAC ATCCTTCCAQ GCACTCTCCC CGGCCAATAG TCAAGCCGAA AGCATCAAGT 2460 

CAAAGTCTCC CGATGCTGGG AGCAAAGCAG AQAQCTCCGA GAACAGCCGC ACTGAGATGG 2520 

AAGGTCGGAG CAGTCTCCCT TCCACGTTTA TCCGAGCCCC GCCGACCTAT GTCAAGGTTG 25 80 

AAGTTCCTGG CACATTT3TG GGACCCTOSA CATTGTCOCC AGGGATGACC CCTTTGTTAG 2640 

CAGCCCAGCC ACGCCGACAG GCCAAGCAAC ATQGCTQCRC ACGGTGTGGG AAGAACTTCT 2700 

CGTCTGCTAG CGCTCTTCAG RTCCACGAGC GGACTCACAC TGGAGAGAAG CCTTTTGTGT 2760 

GCAACATTTG TGGQCQAGCT TTTACCACCA AAGGCAACIT AAAGGTTCAC TAC3iTGACAC 2820 

AOSGGGCGAA CAATAACTCA GCCCGCCGTG GASGQAAGTr GGCCATCGAG AACACCATGG 2 880 

CTCTGTTAGG TACGGACGGA AAAAGAGTCT CAGAAATCTT TCCCAAGQAA ATCCTQGCCC 2940 

A TGTGGACCCT GTTGTGTGGA AOCAGTACAC C3^CATGCTC AATGGCGGTC 3000 

A GACCAATGAG ATCTCTGTGA TCCAQAQTGG GGGGGTTCCT ACCCTCCCGG 3060 

TITCCTTGGG GGCCACCTCC GTTGTGAATA ACGCCACIGT CTCCMQATQ QATOQCTCCC 3120 

AGTCGGGTAT CAGTGCAQAT GTGGAAAAAC CAAGTGCTAC TGACGGCGTT CCX^AACACC 3180 

AGTTTCCTCA CTTCCTGGAA GAAAACAAGA TTQCMGTCAG CTAAGGGAGA ACTTGCGTGG 3240 

AAGGAGCAAT GCAGACACAG TGAAATCTCT AGAATCTGCT TTGTTTTOTA MAACTCATC 3300 

ACAACCTCAG 



1 1 I I I I 

MSRRKQAKPQ HINSEEDQGE QQPQQQTPEF ADAAPAAPAA GELGAPVHHP GNDEVASEDE 
ATVKRLRREE THVCEKCCAE FFSISEFLEH KKNCTKNPPV LIMNDSEGPV PSEDFSGAVL 
SHQPTSPGSK DCSRENGGSS EDMKEKPDAE SWYLKTETA LPPTPQDISf LAKGKVANTN 
VTLQALRGTK VAVNQRSADA LEAPVPGANS IPVIVLEQILC LQOQQLQQIQ LTEQIRIQVN 
MWASKALHSS GAGADTLKTL GSHMSQQVSA AVALLSQKAG 3QGLSLD7.LK QAKLPHANIP 
SATSSLSPGL APFTLKPDGT RVLPNVMSRL PSALLPQAPG SVLFQSPFST \A^DTSKKGK 
GKPPNISAVD VKPKDEAALY KHKCKYCSIW FGTDSSLQIK LRSHTGERPF VCSVCGHRF" 
TKGNLKVHFH RHPQVKANPQ LFAEFQDKVA AGNGIPYALS VPDPIDEPSL SLDSKP./L\ - 
TSVGLPQNLS SGTNPKDLTG GSLPGDLQPG PSPESEGGPT LPGVGPNYN3 PRAGGFQGSG 
TPEPGSETLK LQQLVENIDK ATTDPNECLI CHRVLSCQSS LKMHYRTHTG ERPFQCKICG 
RAFSTKGNLK THLGVHRTNT SIKTQHSCPI CQKKFTNAVM LQQHIRMHM3 GQIPNTPLPE 
NPCa)FTGSBE MTVGENGSTG AlCHDDVIES IDVEEVSSQE APSS3SIWPT PLPSIHSASP 
TLGFAMMASL DAPGKVGPAP FNLQRQGSRE NGSVESD3LT MDSS3LMGDQ EYQSRSPDIL 
ErrSFQALSP ANSQAESIKS KSPDRGSKAE S3ENSRTEME GRSSLPSTFI RAPP7YVKVE 
VPQTFVGPST LSPGMTPLLA AQPRRQAKQH GCTRCGKNFS 3ASALQ:HER THIGEKEFVC 
NICGRRFTTK QNLKVHYMTH QANNNSARRG RKLAIENTMA LLGTDGKRVS EIFPKEILAP 
SVNVDPWWN QYISMLNGGL AVKTNEISVI QSGGVPTLPV SLGATSWNN ATVSKMDGSQ 
SGISADVEKP SATDGVPKHQ FPHFLEENKI AVS 



1 I 1 I I I 

TCTAGCATTA AAGGAAQAAA CCXCGTCTTG CAACGAAGGC CTCAAAGAGG TCCAAATATC 
AACTTGCAGA CATAACAAGC AGftGTGTTTC TAAAOTGCTC TAAGAAAAGA AAGGTTAAAC 
TCTGTGAGTT GAAGGCACAC ATCACAAAGT AGTTTCTGAG AATGATTCTG TCTAGTTTTT 
fiTTTQAAGAT ATTTCCTTTT CTACT3TTGG C31TCAAATCG CTTGAAATCT CCACTTQCAA 
ATTCCACAAA AAGAGTGTTT CAAATCTQCT CTGTGTAAAG GGACGTTCCA CTCTGTGAGT 
TGAATACACA CAACACAAGG AAGTTACTGA GAATTCTTCT GTCTAGCATG ATATGAAGAA 
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ATCCCGTTTC CAACGAAGGC CTCftATGRGG TCTATATATC CACTTGCaOA CITTACAAAC 
AGAGTGTTTC CTAACTGCTC TATGAAAAGA AAGGTTAAAC TCTGTOGAGT TGAACGCACA 
CATCACAAAA GGAGTTTCTG AGAATCATTC TGTCTAGTTT TTTTAGGAAG A^" 
TTCIACCGTT GACTTCAAAG CGGCTGAAAT 



11 21 31 41 51 

I I I I I 

A AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 
CCTTCTTTTG CTGCIGGTGC TCCTGGAGCT GGGAGAGGCC CflAGGATCCC TTCACAGGGT 
GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AQCTCTCTGA 
:C GaGTCCTGCT CAATGGACCA 



GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TaCTTCGGCA CTATCTCCAT 
TGGCTCCCCA CCACAGAACT TCACTGTCRT CTTCGACACT GGCrCCTCCA ACCTCTGGGT 



G TAUTUlJAi-TA ui-j-j-aui-i-iki v.ji«u«>-^>-.^v- r^^..^^..^C AGCCTTCCCA 
GTCCAGCALA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 
GTCCGGGATC ATTGGAGCCG ACCAftGTCTC TGTGGAAGGA CTAACKGTGG TTGGCCAGCA 
GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 
TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCXaiGTAT TTGRCAACAT 
GATGGCTCAG AACCTGGTOG ACTTGCCGAT GTTTTCTGTC TACftTGAGCA GTAACCCAGA 
AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACQAC CACTCCCATT TCTCTGGGAG 
CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 
GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 
TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 
CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 
CTTCACCATT AACGGACSICC CCTATACCCT CAGCCCAACT 3CCTACACCC TACTGGACTT 
CGT3GATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 
TGGGCCCCTC TGGATCCTGG GGGRTGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 
TGGGAATAAC CGTGTGGGAC TGGCCCCRGC AGTCCCCTAA GGACGGGCCT TGTGTCTGTG 
CCTGCCTGTC TQACAGRCCT TGAATATCTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 
GTTATTTTCC AGAGAATGTA GCTQTTTCCA GGGTTGCAAC TTGAATIMG ACCAAACAGA 
ACATGAGAAT ACACACRCAC ACRCACATAT ACACACACAC ACACTTCACA CATACACACC 
ACTCCCACCA CCGTCRTGAT GGAGGAATTA CGTTRTACRT TCA1ATTTTG TATTGATTTT 
■IGATTATGAA AATCASAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 
GCAGAGATCA TGGTATRATR AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 
CACACGGCCA GGCCTGTTTA TCTRCACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 
GTACCTGGAT CATTCIQAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 
TAACATCCTT AAATRTACAA TCGGAATTCA AQCATCTCCC ATTGTCCCAC AAATGTTTGG 
CTGTTTTTGT AGTTGGATTG TTTGTRTTAG GATTCAAGCA T^CCCATAT ATTGCATTTA 
TTTGAAATGT CTGTAAGTCI CTTTCCATCT ACRGAGTTTA GCACATTTGA ACGTTGCTGG 
TTGAAATCCC GAGCTGTCAT TTGACATQQT TCTCTGAACT TATCTTTCCT ATAAAATGGT 
AaTTAGRTCT GGAGGTCTGA TTTTGTGGCR AARATACITC CTAGGTCGTG CTGGGTACTT 
CTTGTTGCAT CCTGTCAGGR GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTrAAGA 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC MSCKTITT 



1 I I I I 1 

MKTLLLLLLV LLELGEAQGS LHRVPLRRHP SLKKKLRAKS QLSEFWKSHN LDMIQFTESC 
SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS MLWVPSWCT ^CRTHSRP 
QPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGQTFTOAB 
FDGILGLQYP SLAVGGVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAGSE LIFGGHIHSH 
FSGSLNWVPV TKQAYWQIAL DNIQVGGTVM FCSEGCQAIV DTCJrSLITGP SDKI^QNR 
IGAAPVDGEY AVECANLNVM PDVTFTINGV PYTLSPTAYT LLDFVIXSWF CSSQFQGLDI 
HPPRGPLWIIj GDVFIRQFYS vpdrgnnrvg lapavp 



Seq ID NO: 



1 11 21 31 41 51 

TACGCQCTQC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CA.GCCIQCGA CATCGAOSGG 
TAt-bl-Ui-iuu ^^^^-Tf^iin inr-oiTaoTr: rrTTCTCGGG GGTGGCCACG 



TACGCQCTQC GGGACCGGCA GUUU««i-uu- AiL.uuuui.i-n v.^.3v,v...v>v.-.. 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 
taSSEgaca RGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 
GTCAACQTQG CCCGIGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 
^AAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC . 
cStgatgccc TCATTGAAAT GGACCCTGAG GCCACTGACC TCTCCCGGGG CATTCTGGCG 
CTCAGAQATG TGGCTGCTGA GGCTGGGGTC AGCAAi^TATA CAGGGGGCQG AGGCGTCAGC 
G?S^cS TCOrCAGCAG CAGTGCCTOS GRTAtaTCT GCGRCAAI^ ^^^^ 

Iacitcctit tccrcaaccg qqqqgatggc acctttgtgg acgctgcggc cagtgctggt 

SSIcC ^^GCA TGGGCGAGGT GTCGOCCTGG C^CTTCAA C^TGATGGC 
^OTGGACA TGQTCTATGG CAACTGGAAT GGCCQCCACC GCCTCTATCT QCAAATGAGC 
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ACCXaVTGGGA AGGTCCGCTT COSGQACaTC GCCTCaCCCA AGTTCTCCAT G 

GTCCGCACGG TCATCACCGC CGRCTTTGAC AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

AAC»TTGCCT ACCGCAGCIC CTCAGCCAAC CGCCTCTTCC GCGTCATCCG TAGRGAGCRC 840 

GGAGACCCCC TCATCGAGGA GCTCAATCCX: GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

ACAGGGGGTG TGGTGACCGA CTTCGACGGA GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 

GGAGAGTCCA TGGCTCAGCC GCTGTCCJGTC TTCCGGGGCA ATCAGGGCTT CAACAACAAC 1020 

TGGCTGCGAG TGGTGCCACG CACCCGGGTT GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1080 

CTCTACACCA AGAAGAGTGG GGCCCACCTG RGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

TGTGAGATGG AGCCCGTGGC ACACTTTGGC CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

GTGACGTGGC CAGATGGCAA GATGGTGAGC CGGARCGTGG CCAGCGGGGR GATGAACTCA 12 60 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGQACCCAGC CCCaCTGGAG 1320 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACACCTATGG AAQCTACAGG TGCCGGACCA ACAAGAAGTG CSGTCGGGGC TAOGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCX: CGCCCCACCA 1500 

CCCCCACCGC TGCTGCTGCC RCTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CCAGCTGCTG AGCAGGSGTG GGACATGAAC CAGCGGATGG BGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 1800 

CTGTGCTGGG COiCATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

ATTCCAGTGG GTCTA^TGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CX3CATCTGCA 2040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATQTAA 2100 

GACACTTGGC ACAAAACCTG GCACATAGIA AAGGCTCAAT AAAARCAAGT GCCTCTCACT 2160 
GGGCTTTGTC A 



n #! NF 060528.1 



1 I 1 1 1 1 

MDPBASDLSR GILALRDVAA EfiGVSKYTGG RGVSVGPILS SSASDIFCDN ENQPNFLFHN 
RGDGTFVDAA ASAQVDDPHQ HGRGVALADP NRDGKVDIVY GNWNGPHELY LQMSTHGKVR 
FRDIASPKPS MPSPVRTVIT ADFDHDQELB IFPNNIAYRS SSSNRLFRVI EEEHGDPLIE 
ELNPGDALBP EGRSIGGWT DFDGDGMLDL ILSHGBSMSQ PLSVFRGNQG FmnWlRVVP 
RTRVGAFARG AKWLYTKKS GAHLRIIDGG SGYLCEMEPV AHFGLGKDEA SSVEVTWPEG 
KMVSRHVASG EMNSVLEILY PRDEDTLQDP APLBTPMNAS SSHSCALETS PYVSTPMEAT 
GASPTRSAVG ATSPTRMAQP AWGLSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



Saq ID NO: 57 DNA sequence 

Nucleic Acid Accession #! AJ279016.1 

Coding sequence: 1..1962 



11 21 31 11 51 

I I I I I 

lA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATC3iC TGAGGGOTCC 
CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 
AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACT6 ATQTGGACCA TGATGGGGAC 
TTTGAGATCG TCGTGQCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGQGCC 
CAGAAGCGGC TGGTGAACAT CGCGGTCGRT GAGCGCAGCT CACCCTACTA CGaSCTGCGG 
GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 
GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CaCCQACAAG 
TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 
CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGQCCT GTGTGGACAG MAGGGCTCT 
GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC MATSCCCTC 
ATTGAAATGG ACCCTGAGGC CAGTGACCIC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 
HCTGCTGAGG CTGGGGTCAG CRAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCCCATC 
CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGQCCTAA CTTCCTTTTC 
ScCGGG GCGATGG^C CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 
CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA ^^rOGACATC 
GOT3GCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 
GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGQTC 
ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 
CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTWTCCGTA GAGAGCACGQ AGACCCCCTC 
ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 
GIGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATC- T^-TC-CATi^- AGAGTCCATG 
rScMCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCIGCGAGTG 
GKCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCl CTACACCAAG 
AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 
CCCGTGGCAC ACTTTGGCCT GGGGRAGGAT GAAGCCAGCA QTGTGGAGGT GACGTGGCCA 
Stggcaaga TGGTGAGCCG GAACGTGGCC AGCGGGGAGA TGAACTCAGT gciggagatc 
CTCTACCCCC GGGATGAGGA CACACTTCA3 GACCCAGCCC CACTGGAGTG TGGCCAAGGA 
TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATOCATCCA GITCCCATTC 
^TGCCCTC GAQACAAGCC CGTATGTGTC AACACCIATG GAAGCTACAG GTGCCGGACC 
i^GAAGT GCAGTCGGGG CTAOSAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 
CTCQGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTQCTGCrGC ^Ci:<^<^^<^ 
GCTGCTGCCG CrGCTGGAGC TGCXS^CTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 

248 



wo 03/003906 



PCT/LS02/21338 



S TGGTTAAC3GA GAGCTGCGAG CCX»GCTQCT QRQCAQGGGT GGGACATGAA 1980 

CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGAO^T 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCXICAAGCCC ATCXATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CRTTCCAGTG GGTCTAATGA CCATATCTIA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACRGG AAGTATGAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCITACA CCAGTCACTT 2340 

AACTTGTTAO CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

TTAGTGTGGA GATTAGATTA AATGTATSTA AGACACTTGG CACAAAACCT GGCACATAGT 2460 
A TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 



1 11 21 31 41 51 

I I I I I I 

MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLPPDVD SNPTQLNYGV AVTDVDHDGD 
FEIWAGYNG PNLVLKYDRA QKRLVNIAVD ERSSPYYALR DRQGNAIGVT ACDIDGDGRE 
EIYFLNTNNA FSGVATYTDK LFKFRNNRWE DILSDEVNVA RGVASLFAGR SVACVDRKGS 
GRYSIYIANY AYGNVGPDRL lEMDPEASDL SRGILALRDV AAEAGVSKYT GGRGVSVGPI 
LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP HQHGRGVALA DFNRDGKVDI 

™ T r-.^,r.m,r,v 170T;i3nT I ODV FCMDqPVBTV TTADFDNDOE LEIFFNNIAY 



VYQNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV ITADFDNDQE LEIFFNNIAY 
RSSSANRLFR VIRREHGDPL lEELNPGDAL EPEQRGTGGV VTDFDGDGML DLILSHGESM 
AQPLSVFRGN QGFNNNWLRV VPRTRFGAFA RG.AKWUYTK KSQAHLRIID GGSGYLCEME 
PVAHFGLGKD EASSVEVTWP DGKMVSEITVA SGEMNSVLEI LYPRDEDTLQ DPAPLEOGQG 
FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT NKKCSRGYEP NEDGTACTGT 
LGQSPGPEPT TPTAAAATAA AAAAAGAATA APVLVDGDLN LGSWKBSCE PSC 



Seq I 



1 11 21 

I I I I 



ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCIGGTT GGATGGGACT GGGTGGGCCC £ 
AG03GCTCCT CCCCAGCATC CCCICCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 
GTTCrGASGT ATGACCGGGC CCAGAAGCGG CTGGTGRACA TCGCGGTCGA TGAGCGCAGC 3 
TCftCCCTACT ACGCGCTGOG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC < 
ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC ; 
40 CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCC»CAGAA ACAGQCCTGT GCTGAAGCCT . 
CCACCTACAA CCCCTQCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG QGACTTTTCC 
TOT™ GTCAGGCTTC TCCGGACA6C AGGCAGGGAG AG^^^GCC G^TTCCCTGC • 
TGTCGGGGTG GACTGAGACC TACCCATGAA CCAGAACCAT TTCITCTGAG ACCCAAATCA 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GOA^GACATC < 
45 CTGAGCGATG AGGTCAACGT GGCCCaTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 
GCCTGTOTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATT6CCAA TTACGCCTAC 
GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCXM AGGCCAGTOA CCTCTCCCGG 
rrrATTCTSG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 
ScTCcScA CTGCCTCTCC A..GCATTGGT GAGATATCIG GCAGAACCGA GGA^CGGGAA 
50 ggaggaScc CAGAGGAGGC AGATGAGGAO CACAGTGGGG mQOA^cac cagccaactg 
TGCCGGCTGG GCTGGAAGGA CGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GOTGQAGGAA 

cagagggagg ctqgggcagc tggcgtgccc agaggacgtg ttcoaacagc tctgcagact 

T^AAAAGCC ATTTGGCTGA CAAGAACCTA TrPGGCCCAC CATGTTACTA TTCTGTCTGC 
GC^CTC CAGCCCACCC TTTCCCTGCC CGCCAAOCCC CCCAACACTA CCCTCTAGCC 

55 cccStgtca ctcagctaat gacacatgga cgtctggctg gaaaactagc ccggagtotc 
CCCCACCCCC gagccccagg aatggacccc aaatgtaagg gccgccatgc TGftGCC^GC 
CTGATGGCTG AGGCTTTGGG CGCGTGGCCA GCGCTCAGCA CCACTQTGOT GES^^GGGGGC 

ctgagaagct gggaggaaag caggcagaag gggcaggcca tgtccagatg tgcactcagg 

GMCTGGGAG GTCCCTGGAG CCAf.GCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 
60 CTGGGAGAAC CTCCCATTTT ACAAAGAP.CA GACGGAGATC CAGGGAGGAG AAOQGACTCG 
CTCAAGGTCA CACAGGAGT3 CCATCTAGTG GCCACCA-EX3C CAGCTCTCGG 33°^=!=°^ 
rrrCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 

ctctccStc ccctgg^Sc caacttcccc agctgcttga ggcctcttga agccgggaca 

GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGA^.CTGGG TTCTGGACAT SGCCAAGGCC 
65 CKGCGTOGA ACCAGATGGA AAAAGAGGAG GGGA^.GATTC ATGGAGACCA TGAGCCCAGA 
TTTAGGCTCA OGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 
CTG^GTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT SaSGCTTGCT 
TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GRGGCGTCAG CGTGGGCCLC 
Itcctcagca GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC taacttcctt 
70 nCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 

SotStcg ttcacctcaa atatcacctc tgcagagatt ttcctcactc cctgtgccac 

CTMCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGIGGC ATGCACGTCT TCTTCAGGCT 
rr-lfRTTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 
TTCTTGACGC AAQGCTTCGC CTCCAGTGCC CACCXMAGGA CACTCAGCCT CCAGGGTTCT 
75 CRGGGGGCCC CACCCTQCCT TCTGQCAAGA GCTCCCIGTG TCCTGGGGTC TCTGATCCCC 

Stkctatt acattgtcct gtggtctgcc atcccagaga qcctgatgac ccacagctat 

TTGTCCTCTG AAAGAGTCAA CGTGGGTCJTG GACGACCCCC ACCAGCATGQ GCGAGGTGTC 
GCCCTGGCTG ACTTCAACCG TQATGGCAAA GTGGACATOS TCTATGGCAA CTGGAATGGC 

249 



wo U3/003906 



PCT/US02/21338 



A AATGfiGCACC CATGGGAAGG TCCGCTTCCG GGACaTCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCXCCTGTC CGCACGGTCA TCRCCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCSU^CCGC 2820 

T GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGQAAC 2880 



A CAGSTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTQCaGGQ 3060 
AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGa3CCACA AGATACSiAAG i 3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGSG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAG3GGC TACGGGGTCC AATCACTACC AGGRRAAGGG 3300 

GCTACQGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAG 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAS GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3 600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3 6 SO 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3 780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG GTCCAATCAC TACCAGGAAA 3 340 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3 900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3 960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TOAGGGCITC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTQC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT ffTGAGATGQA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 4320 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GQACCCAGCC 4380 

CCACTGGAGT SIGGCCAAGQ ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGIGCCCT OGAGACAAGC CCGTATGTGT CAACACCTAT 4600 

GGAAGCTACA GGTGCCGGAC CAACAAQAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4660 

GGCACAGCCT GCGTGGGTAC TQAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 



51 

r 



MACPGGLPAR CSGWMGLGGP SGSSPASPPH SSSRYNGPNL VLKYDRAQKK LVNIAVDERS 60 

SPYYALRDRQ GNAIGVTACD IDGDGREEIY FLNTNNAFSG HSSSAQVPSG LHENRPVLKP 120 

PPTTPAGLLQ LPPLSGRDFS SSLGQASPDS RQGERVPVPC CRGGLRPTHE PEPFLLRPKS 180 

GVATYTDKLF KPRNNRWEDI LSDEVNVARG VASLPAGRSV ACVDRKGSGR YSIYIANYAY 240 

QNVGPDALIE lOPEASDLSR GILALRDVAA EAGVSKYTEG FSHTASPSIG EISGRTEERE 300 

GGDPEEADEB HSGDGSTSQL CRLGWKDGOP KEEAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLRDKNL FGPPCYYSVC APSPAHPFPA RQAPQHYPVA PLVTQLMTHG RLAGKLARSV 420 

PHPRAPGMDP KCKGRKAEPG LMAEALGAWP ALSTTWPGG LRSWEESRQK GQAMSRCALR 480 

ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRRDS PWTQECHLV ATMPALQGLE 540 

GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLEAGT VPGAAI.PGNP GNWVLDMMCA 600 

LAWNQMEKEE GKIHGDHEPR FRLRKAREAE FPPGSSEEPL LQFPSGLRGS PVLQVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGPNFL FHNRGDGTFV DAAASAE^b 720 

AFIVHLKYHL CRDFPHSLCH LABTGPSSSC CPKHARLLQA PHCHHGLSMS FTRTGSRFYS 7B0 

FL^QGLASSA HRRt'sLQGS QGAPPCLLAR APCVLGSLIP TAYYIVLWSA "ESL^SY 84 

LSSERVNVGV DDPHQHGRGV ALADFNRDGK VDIVYGNWNG PHRLYLQMST HGKVRPRDIA 900 

SPKFSMPSPV RTVITADFDN DQBLEIFFNN lAYRSSSANR LFRCSILARG SSSLTAGGRN 960 

GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AKEPASAIAG 1020 

KGKGNVAOSV PRTQAPQDTK PHYHKKGLQG PITTRKRGYG VQSLPGKGAT GSNHYQEKGL 1080 

RGPITTRKRG YGVQSLPGKG ATGSNHYQEK GLQGPITTRK RGYGLQSLPG KGATGSNHYH 1140 

RKGLRAPITT RKRGYGVQSL PGKGATGSNH YQEKGLRGPI TTRKRGYGLQ SLPGKGATGS 1200 

NHYQBKQLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL RGPITTRKRG YGLQSLPGKE 1260 

AMGSHHYQEK GLRAPITTRK RGYGVQSLPQ KGATGStr/IR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD PDGDGMLDLI LSHGESMAQP LSVFRGNQGF NNOTILRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS SVEVTWPDGK MVSRNVASGE 1440 

MNSVLEILYP EDBDTLQDPA PLECX3QGPSQ QENGHCI-IDTN ECIQFPFVC? RDKPVCVKTY 1500 

GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR PKKELQLSQG ICTPVWSFFL 1560 
PGCRLLLKRA QLQAAPSTLL QKRPGIPEAQ VYEQDQE 



Seq I 



M 000584.] 



I I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCRGGAAGAA ACCACCGGAA GGAACCATCT 

250 



wo 03/003906 



PCT/US02/21338 



T AAACATGACT TCCAAGCTGG COSTGGCTCT CTTGGCSfflCC TTCCTGATTT 

CTGCAQCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGT3C TAAAGAACTT AGATGTCAGT 
GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 
AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 
TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 
CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 
GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 
A TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAOTTTTT 
h TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 
^TAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 
TACAAATAGC AAAATTGAGC CAAQGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 
iGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 
h AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 

C TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 

TTTCTAAGTG GAAAAAGTAT TAGCCAGGAT CTTACCTCAC AGTGATGTTS TGAGGACATG 
TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 
CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT STGC.AAGA^T TTCGAA-AAAT 
AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGT.iiAATTTR TTTTATTTTA 
GATATTARAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 
ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 
AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 
AGTCTTGTCA TTGCCAGCTQ TUrTGQTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTTAG 
AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT 
TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 
AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATCTGCT CTCCAAATTT TTTTTACTGT 
TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCR 
AAGTAAAAAA AAAAAAAAA 



LsKLAVAbL AAFLISAALC EGAVLPRSAK ELROJCIKTY SKPFHPKFIK BLRVIESQPH 60 
CANTEIIVKL SDGRELCLDP KBNWVQRWE KFLKRAENS 

Seq ID NO; 63 DNA sequence 

Nucleic Acid Accession ft; BOS sequence 

Coding sequence: 52-1146 

11 21 31 41 51 

1 1 I I 1 1 

QTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCTGAACG CTQATCCCAT AATGCATCCT 60 

CAAQTGGTCA TCTTAAGCCT CATCCTACAT CTGGCAGATT CTGTAGCTGG TTCTGTMAQ 120 

GTTGGTGGAG AGGCAGGTCC ATCTGTCACA CTIACCCIGCC ACTACAGTGG ASCTGTCACA 180 

TCCATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 240 

ACCAATSGAA CCCACGTCAC CTATCGGAAG GACACACGCT ATAAGCTATT GQQGQACCTT 300 

TCAAGAAGQQ ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG TGGCGTATAT 360 

TGTTGCCGTG TTGAGCACCG TGGGTGGTTC AATGACATGA AAATCACCGT ATCATTGGAG 420 

ATTGTQCCAC CCAAGGTCAC GACTACTCCA ATTGTCACAA CTGTTCCAAC =5™^=! "0 

GTTCGAACGA GCACCACTGT TCCAACGACA ACGACTGTTC C^TGACGAC TGTTCCAACG 540 

ACAACTGTTC CAACAACAAT GAGCATTCCA ACGACAACGA CTGTTCTGAC GACAATOACT 600 

GTTTCAACGA CAACGAGCGT TCCAACGACA ACGAGCATTC CAACAACAAC AAGTGTTCCA 660 

GTGACAACAA CTGTCTCTAC CTTTGTTCCT CCAATGCCTT TGCCCAGGCA SAACCATQAA 720 

CCAGTAGCCA CTTCACCATC TTCACCTCAG CCAGCAGAAA CCCACCCTAC GACACTGCAG 780 

GQAQCAATAA GGAGAGAACC CACCAGCTCA CCArlGTACT CTTACACAAC AGATGGGAAT 840 

GACACCGTQA CAQAQTCTTC AGATGGCCTT TGGAATAACA ATCAAACTCA ACTQTTCCTA 900 

Stagtc TACTGAKGC CAATACCACT AAAGGAATCT ATGCTGGAGT CTGTATTTCT 9 

GTCTTGGTGC TTCTTGCTCT TTTGGGTGTC ATCATTGCCA AAAAGTATTT CTTCAAAAAG 1020 

gaggttcaac aactaagtgt ttcatttagc agccttca;>a ttaaagcttt scaaaatgca 



GAGGTTCAAC AACTAAGTGT TTCATTTAGC AGCCTTCAAA o^^.^^^ 

gttgaaaagg aagtccaagc aqaaqacaat atctacattg rgaatagtct ttatgccacg 

GACTAAGACC CAGTGCTGCT CTTTGAGAOT TTACGCCCAT 3AGTGCAaA.R GACIGAACAG 



GACTAAGACC CAGTGGTGCT i;tttu«uhui L^r^^^^--^.^ ' I"" 

ACATCAQCAC ATCAGACGTC TTTTAGAQX CAAGACAATT TTTCTGTTTC AGTTTCATCT 1260 

GGCATTCCaA CATGTCAGTS ATACTQGGTA GAGTAACTCT CTCACTCCAA ACTGTGTATA 1320 

GTCAACCTCA TCATTAATGT AGTCCTAATT TTTTATGCTA AAACTCGCTC AATCCTTCTG 13 80 

ATCATTGGAG TTTTCTCTCA AATATGAACA CTTTATAATT GTATGTTCTT TTTAGACCCC 1440 

ATAAATCCTG TATACATCAA AGAGAATAGT CC<^^<^^ ^TCcScG^ 1560 

TTSGCCATCA CAGCTGTCCA GAAGAGGGGA ATCTGTCTTA AAAACCAG^A A.RTCCAACGT IbbU 

GAGACTTCAT TTGGAAGCAT TGTATGATTA TCTCTTGTTT CTATGTTATA CTTCCAAATG 1620 
TTGCATTTCC TATQTTTTCC AAAGGTTTCA AATCGTGGGT TTTTATTTCC TCCGTGC-GGA 

AACAAAGTGA GTCTAACTCA CAQQTTTAQC TGTTTTCTCA TAACTCTGGA AATGTGAT^C. 1/^u 

ATTAAGTACT GGATCTCTGA ATTGGGGTAG CTQTTTTACC AGTTAAAGAG CCTACAAiAt, IBUU 

TATGGAACAC ATAGACACCA GGGGAAGAAA ATCATTTGCC AGGTGATTTA ACATATTTAT 1860 

GCAATTTTTT TTTTTTTTTT TGAGATGGAG CTTTGCTCTT GTTGCCCAGG CTGGAGTGCG 1920 

ATGGTGAAAT CTCGGCTCAC TGTAACCTCC ACCTTCCGGG TTCAAGCAAT TCTCCCGCCT 1980 

CAGCCTCACG AGTAGCTGGG ATTACAGGTG TCCGCCACTA TGCTCAGCTA MTTTTTGTA 2040 

WTTTTTAGT AGAGACAGGG TTTCATCATG TTGGCCAGAA TQATCTCGAA CTCCTGACCT 2100 

CAGAGGATCC AGCCTCCTTG QCCTTCCAAA GTGCTGGGAT TACAGGCATG AGCCACCQTQ 2160 
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CCTGQTCCAT TTATQCAATT TTTATTACAC GTGTGTCATG GTGCTAAGCA TTTAflACAAC 2220 

CATCAGGCTT TCTGAACAAA TTTTTAGAAC CAAGAAGATC AGCAATAAAA GAAAAAAAAA 2280 

AGGOSGGAGT AGGGAAAAGT CTGCCTTTTC TGCCTTATAA GTTACATTGC AGATATAAGT 2340 

GTTTGAAACT CACTAGTTTT GACCCAAATT CTAAATCGCC TCCAAACCAC TTTCATGTTT 2400 

CAAGGTGGGC AGGTAGCTTG AGCCCAGAAG TTTGAGACCA GCCTGGGCA=i CATGGCAATA 2520 

T ATTAAGAAAA AAATTTAAGG AAAAAAAAAA AAGAWJiAAT TGTTTTGGAA 2560 

?TCTCAT ATCTAATCAA TTCCTGTATT tTTTTTTTTT TTTTTTTTAC 2640 

___JTATTGT AAGGATATCT TTAGTCAGTA TAAAAATATA TAAAAAACAA 2700 
A CCAGTATAAT TGTTTAGAAG TATATATTTG CCGGCCGGGC 
A TCCCAGGGCT TTGGAAGGCT GAGGTGAGTG GATCACTAGA 

3 CCTGACCAAC ATGGAGAAAC ACTGTTCTCT ACTAAGTATA 

GGCTAGATTT GTCAGAAAAG AAACATTTCT ACATGAfiCAG TTAAGGCTGG 
CAATTTCAGT TTGAGTAAAC TCTGAAATAA TTCATGGTTG AGGITCAGAG 

CATAGTTGTC TGGGGCAACA GGATGCTTGC TTTCAGAGGA AACrrTTTGT ^tTTGTTTTT 3 060 

gijijITTTTGA GATGGAGTTT TGCTCTTGTC ACCCAGGCTG AAATGCAATG GCACAATCTT 3120 

GGCTCaCTGC AACCTCCACC TCCCAGGTTC AAGTGATTCT CCTGCCTCAG CCTCCTGAGT 3180 

AGCTGGGATT ACAGATGCCC GCCACAACGC CCAGCTTATT TTTAGTAGAG ACAGGGTTTC 3 240 

ACCATATTGG CCaGGCTGGC CTCCAACTCC TGACCTCAGG TGAGCCACCC GCTTTGGCCT 3300 

CCCCAAAGTG TTGOQATTAC AGGCXTAAGC CACCGCGCCC AGCCAGAGGC AACATTTTTT 3 3 60 

AACGCAGTTA TCATTCTAGG AAATTTATAG GTCCTTTG.^A GGA^AATTCT GTGGGCAAAT 3420 

AAGATTGTGA TACATGGTAT TTCAGTTTTC CCAAhlGrGG CCAGCCCGAT CTGGTCAAAA 3480 

ATTTrATTTT TTAAAAGCTA TAGTGTCTTT IT^rCTTAAP. rr I'GAGGCAA CATGCACAAA 3540 

ATTGGAGATT TGflflATTAAA GCCRAGATTT QTAGTTTCTC TGGAAAQACC TGGCAAGATT 3600 

GGACTGGATT GCTATGTGAC CAGGGTCCCA CTAGATGGGG CTGCATCCTC TAATCCCCAA 3 660 

ATCCTTATQT TCCCTGCATG CTCACCTTTG TTACCTGCCT GACACCTGTG GGGCTTTTAA 3 720 

CTTTATGGCA RCTGCCCTAT TCTCTQGATC CTTCCTGAGG ATTTATGATG CGTAATACTC 3780 

CAGGAATCTG STTAGCTTTG CTTAACACAT TTCCAAAACT TGTTTGRATG CATGAGTACA 3840 

GTCACTAGTA GCATTCTGTG CAGTACAATG TATGGGGGCT TRGGASITrjf GGGTAGTATA 3900 

CAGGATTAGG GATAGGACTT QAGTCTAATC CTAACTCTTA GCAGTTACAC TGGATGACAT 3960 

TAGAGCAAAT GGTTCTTTAC GTCTACRTTT TCTTCATCTQ TAGATSTAAT AATTTCCATA 4020 

TCAACTATGA TGTACAGTSC TAATTCX:AAT GAAATGTTAC ATGTGAGAAQ TCTTTGAAAT 4080 

GTAAAAAACA CTACAQATAC TGAAGCAGTT TGGAGAATTA AAAAACACTA CGAAAACACA 4140 

GCTTGGTAAA TAATTTTTTT TTTCTTTTGA QACAQASrCT GGCTCTGTCA TCCAGGCTAG 4200 
AQTGTAGTG 

Probein Accession ft: NP_036338.1 

1 11 21 31 41 51 

iLiPQWILSI. ILHLADSVAG SVKVGGEAGP SVTLPCHYSG AVTSMCWNRG SCSLFTCQNG SO 

IVWTNGTHVT YRKDTRYKLL GDLSRRDVSL TIENTAVSDS GVYCCRVEHR GWFNDMKITV 120 

SLEIVPPKVT TTPIVTTVPT VTTVRTSTTV PTTTTVPTTT VPnHSIPTT TTVPTTMTVS 180 

TTTSVPTTTS IPTTTSVPVT TTVSTFVPPM PLPRQNHEPV ATSPSSPQPA ETHPTTLQGA 240 

IRREPTSSPL YSYTTDQNDT VTBSSDGLWN NNQTQLFLEH SLLTANTTKG lYAGVCISVL 300 
VLLALLGVll AKKYFFKKEV QQLSVSFSSL QIKALQNAVE KEVQAEDNIY lENSLYATD 



1 1 I I 1 ' 

ftGAAGAAAGC TGQTAGGGGC TGGGAGAGGG TACCACAGGG GAGTATGATC T 



ftGAAGAAAGC TGQTAGGGGC l^^^A^.Hi-^^^ iH^^n^^^^v. o„v,x„.^...- 

CCAGAGAAGG TTCCCTGAGG AAATAGTACC TGAACTTAGA CTTGAAGGAT M«G^TGTT 120 

AArTOGGAGG AGAGAATGTT CCAGGCAGAG GAAAAGGCAT ATGCAAAAGT CCAGCGCC-TT J.OU 

Sa^CAC AGCTGGGGTG CCTGGAGTGA GATGGAGCTG GAAAGATCCA GGTGGAGCTT 240 
G^^^ ACCAGCCCCC AGAGGGCGCT ACTGGAGCCT GGTCAGAGGA 3GACTTTGGG 

GCACCCATCC AGGACAGCTT CGAGTTAATC ATCCCCAATG TGGGCTTCCA GGAAC . JOU 

CCrOOGGAGG CCCXGCTGGA QCGGCGGGCA GTCGCTCTAG GCCGCAGGCT AGCTCGAAGT 420 

CTAGGCCAGC AGGACGATGA AGAGAATGAG CTGGAGCTTG AGCTGGAGCA SS^==^GGAT 480 

GATGAGCCTG ACGTGGAACT TTCTGGTGTT ATGTTCAGCC CCCTCAAGAG CCGCGC JiGG 540 

nrfrTGGCCC ATGGGGATCC CTTCCAGGTG TCCAGAGCTC AAGACTTCCA GGTGGGAGTC 6 00 

aS^^ LqCCCAGAA ACTGGTGGGA GTCAACATTA ACCCCTATGT SGCCGTGCAA 660 

OTGGGGGGGC AGQGCCGTQT GACOSCCACA CAGCGTGGGA CCAGTTGCCC CTTCTACAAT 720 
f^TC? TG??CGAATT TCATGACACG CGGCTTCGTC TCCAAGACTI GCTGCTGGAG 780 
IrcArGGTGA QTGGGGTAGG GGTGACCAGT GTCCTTCAGA GAAGGGGGGA TGAGAAAGCT 840 
GcSS?^ ScCTTC CCCCAAGGCT TTCCATTCGC AGACCCTCCC CTTTATGGCC 900 
ACcSgATAG GCACCTTCAG GATQQACCTQ QQCATCATCr TGGACCAGCC AGGTATGGAA 960 

T^TTCCCCTT AITOAGACTC TGCACGGACA AGGGCCCTAG AQATTGACCC TGCAGTGACT 1020 
cScATGgIc StACAC TCACTTCGGA GAGGGCCATC TCrrGGCO^AG SCTGAACTCT 080 

^CAOTCC GCCCCTCCCT QCTGAQCCAQ AGAAGCCCTG 6CCATTGTCC GTCACTCCGA 1140 

?^QotScG GCCACCCTGT GCOTCCaSCC GGTCGCCCCT TACCCCTGGC TCGCCX:CTTC 1200 

GCcStMA? ^CCAGTTCT ACCAAAGATG GGTTCCGCTG CATGATCCCC GAGACACCCG 126 0 

CGCCGGGACC ARGGGTTTCA TTAAGGTCAC CTTGTCCGTG AtSGGCGOSCG GGGACCTGCC 1320 

SScC^TG CTACCCCCGG CCCCAGGGCA CTGTTCGQAC ATOGAGAAGT QAGCCGGGGT 1380 

^GCT^GA QGAGGACATG GATCCGGGGG TGGCCGTGGG GCGCGGATAA GGGGAGGGGC 1440 
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CGAGATCCCA GTTTCTCCCC CCCCGCTCGG TGCCCCCTCC CCTAGGAACC TGCTCCTGCC 1500 
GCGCGGGGTG CCCGCCGAGA GGCCATGGGC GCGGCTCCGC GTGCGCCTGT AC03CGCCGA 1560 
GGGGCTTCCC GCGCTGCGCC TGGGGCTGCT GGGCAGCCTG QTCCaCQCCC TGCACGACCA 1620 
GCGCGTCCTG GTGGAGCCCT ATGTGCGGGT GTCTTTCCTG GGGCAGGAGG GCGAGACGTC 1680 
GGTGAGCGCC GAGGCGGCGG CGCCCGAATG GAACGAGCAG CTGAGCTTTCG TGGAGCTCTT 1740 
CCCGCCGCTG ACGCGCAGCC TCCGCCTGCA GCTQCGGGAC GACGCGCCCC TGGTCGACGC 1800 
GGCACTCGCT ACGCACGTGC CGGACCTGAG GCGGATCTCC CATCCGGGCC GCGCGGCGGG 1860 
GTTTAACCCT ACCTTCGGCC CGGCCTGGGT GCCCCTCTAT GGCTCGCCCC CCGGCGCGGG 1920 
GCTCCGGGAT AGTCTTCAAG GTCTCAACGA AGGCGTTGGC CAAGGCATTT GGTTCCGCGG 1980 
CCGCCTTCTG CTGGCTGTGT CCATGCAGGT GTTGGAAGGG AGAGCTGAAC CTGAGCCTCC 2040 
CCAGGCCCAG CAGGGGTCCA CGTTGTCCCG GCTCACCCGA AAGAAGARAA AGAAAGCCAG 2100 
AAGGGATCAG ACCCCAAAGG CGGTTCCGCA GCACTTGGAC GCCAGCCCCG GTGCCSAGGG 2160 
GCCTGAGATC CCCCGTGCCA TGGAGGTGGA GGTGGAGGAG CTGCTQCCGC TQCCAGAGAA 2220 
TGTCCTGGCG CCCTGTGAAG ATTTCCTGCT TTTCGGTGTG CTCTTCGAGG CCACCATGAT 2280 
CGACCCCACC GTGGCCTCCC AGCCCATCAG CTTCGAGATC TCCATTGQTG TGTGGCCTAG 2340 
CCGAACCCCT GAGTGCCATT TCAGACCTTA GAACCCTGGA AGGGGTGTTQ ACTTTCAGTC 2400 
GCGCAGGCCG TCTGGAGGAG CAATTGGGCC GAGGGTCCAG GGCTGGGGAG GGAACTGAGG 2460 
GTGCAGCCGT GGAGGCTCAG CCTCTGCTGG GAGCCAGGCC AGAGGAGGAG AAAGAGGAGG 2 520 
AAGAACTGGG GACCCATGCT CAGCGGCCTG AGCCCATGGA CGGCAGTGGG CCATACTTCT 2580 
3CTTGCCCCT CTGTCACTGC AAGCCATGCA TGCATGTGTG GAGTTGCTGG GAGGACCACA 2640 
CCTGGCGCCT GCAGAGCAGC AACTGCGTQC GCAAAGTGGC CGAGAGGCTG GACCAGGGGC 2700 
TGCAGGAGGT TGAGAGACTG CAGCGCAAGC CGGGGCCTGG CQCCTGTGCA CAGCTCAAGC 2760 
AGGCACTGQA AQTACTGGTG GCTGGGAGCA GACAGTTTTG CCACGGTGCC GAGCGCAGQA 2B20 
CGATGACCCG GCCCAATGCC CTQGATCGAT GCCGAGGGAA ACTCCTGGTG CACAGCCTGA 28B0 
ACCTTTTGGC TAAGCAAGGA CTGCGACITC TACGCGGCCI GAGACQGCGC AATGTGCAAA 2940 
AGAAGGTQQC ACTGGCCAAG AAGCTCCrrGG CAAAACTGOG CTTTCTGGCT GAGGAGGCAC 3000 
CCGGGGCAGC CCCTGGTGAG GTCrGTQCCA AGCTGGAGCT CTTCCTGCGG CTGGGCCTGG 3060 
GCAAGCAAGC CAAGGCCTGC ACCTCTGAGC TGCCCCCGGA TTTGCTGCCC GAGCCCTCAG 3120 
CCGGGCTGCC CTCCAGCCTA CACCGGGACG GTCCTGGAGC AGACGCTGAG CCCTCTGTGG 3180 
GATGAACTCC TGGTATTTGA GCAGTTGATC GTGGATGGGA GGAGGGAGCA CCTGCAGGAG 3240 
GAGCCTCCAT TAGTGATCAT CAATGTATTT GACCACAATA AGTTTGGCCC CCCCGTGTTC 3300 
CTGGGCAGGG CACTQGCCGC CCCAAGGGTA AAGCTGATGG AGGACCCATA CCAACGCCCA 3 36 0 
GAGTTGCAGT TCTTCCCCCT GAGGAAGGGA CCCTGGGCAG CCGGAGAGCT CATTGCOSCC 3420 
TTTCAACTCA TTGAACTAGA CTACAGTGGC CGACTTGAGC CCTCAGTGCC CAGTGAGGTG 348 0 
GAGCCCCAGG ATCTQGCACC CCTGGTTGAG CCCCACTCTG GACGCCTGTC CCTTCCACCC 3540 
AAOSTGTGCC CAGTGCTCAG GQAGTTCCGT GTTGAGGTGC TGTTCTGGGG TCTTAGGGGA 3600 
CTTGGTCGTG TGCATCTGCT CGAGeTGGAG CAGCCCCAOQ TTQTACTGGA GGTGGCTGGG 3660 
CAAGGTGTQQ AGTCTGAGGT CCTGGCCAGC TACCGTGAGA GCCCCAATTT CACTGAGCTT 3720 
GTCAGGCATC TGACAGTGGT CTTCAAAGAC ACAGCTCCTC TCTTCCACX:C CCAGGACTTG 3780 
?CGGAGCAGC CTTACTTGCA GCCTCCACTC AGCATCTTOG TGATTGAGCG COSGGCCTTT 3840 
GGCCACACAG TCCTTGTGGG TTCCCACATT GTCCCCCACA TGCTQCQMT «CATTTOGG 3900 
rr-ITATGAGG ATCCTCCTGA GGAGGAAGGA GAGATGGAGG AGACAGGGGA TATGATQCCC 3960 
RJ.GGGACCTC AAGGACAGAA GTCOCTGGAT CCCTTCTTGG CTGAAGCGGG TATATCCAGA 4020 
CAGC1CCTGA AGCACAACTT TGATGAAGAT GAAATGGATG ATCCTQQAGA TrCAGATOGG 4080 
rrrftflCCTCA TTTCTATGGT TGGGGAGATC CAAGACCAGG GTGAGGCTGA AGTCSAAGGC 4140 
ACT^GTCCC cIIS^GC AGTTGCCACC C«SAAGATOr ACAACAGGTC CCTGAAGG^ 4200 
GAATTTAACC ACTTTGAAGA CTGQCTGAAT GTGTTTCCTC TQTACOSAGG GCAAGGGGGC 4260 
r^r-nrrr-AG GTGGAGAAGA GGAAGGATCT GGACACCTTG TGGGCAAGTT CAAGGQCTCX: 4320 
TTCCTCATT? ACCCTG^TC AGAGGCAGTG TTGTTCTCTG AGCCCCAGAT CTCCCGGGGG 4380 
ATCCCACAGA ACCGGCCCAT CAAGCTCCTG GTCAGAGTGT ATGTTGTAAA GGCTACCAAC 4440 
C?GGCTCCTG CAGACCCCAA TGGCAAAGCA GACCCrrACT I^^Tf.I?^? '^^^ « 
GAGCGGCA6G ACACCAAGGA ACGCTACATC CCC-iAGCAGC TL?J^C.CCAT CTTTGGAGAQ 4560 
ATCCTGGAGC TAAGCATCTC TCTCCCAGCT GAGACGGAGC TGACGGTCGC CGTATTTGAT 1620 
^l^^Zll l^^Zfc ^^TGGcSS gScCAO 4,40 

SgGGCCCAC AGGAGCCATT CTGAGTTTCT GGCCAAACAC ATTCAAGCTC ACATTCCCTT 4800 
TTGTGTCTCC AGATCCTATG ATTTCATGGA AGGGGACCCT CCCACCCACC GCCACTGCCA 4860 
IcCAAGACAT AGCTCAGTGG TCAAGACTTG ==^TTGGGAG TCGG3ATCCT 3™^=^^ f/^" 



TCACTTGAUU uuTTTi-TirT TTTATGAAAC AGTC1 

60 AGTQQCACGA TCTCGGCTGA CTGCAACCTC CACCTCCIGG GTVCAAGCGA TTCTCCTGCC 
TttGCCTCCC CAGTAGCTQG GATTACAGGC GTGGGCCCCC ATGTCCAGCT A.=^TTTTTATA 
Stcgctct GTCTCCCAGG TTGGAGTGCA GTGGCACGAT CTCGGCTGAC TGCAACCTCC 
TTTTU>l.lt.l „„„„„ ^^r,r,^„r'rrr jKrrar.rTr.r.n ATTACAGGCG 



TTTTOSCTCT GTCTCCCACsU TititiA^i^i'-rt ^Lo^jv-rt^^f^x ^i^,^^^^-..- 

ACCTCCTQGG TTCAAGCGAT TCTCCTGCCT CAGCCTCCCC AGTAGCTGGG ATTACAGGCG 5220 

TGGGCCCCCA TGTCCAGCTA ATTTTTATAT TTTTAGTAGA GACAGGGTTT CACCATGTTG 5280 

65 TCCASGCTGG TCTTGAACCC CTGACCTCAA GTGATCCACC CACCTCTGCC TCCCAAAGTG 5340 

CTGGGATTAC AGGTGTGAGC CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC b4UU 

TCAtStGCAA AACAGAQGTA ATAAGATCAG TATCTTCTTA ATGGAAGCAC C^SGACTACA 5460 

?TOTTTTCAT TCATTOTTAT CATAAATGAG GACTAACCTG TCTCCCGTTG SGAGTTTTGA 5520 

ACCTAGACCT CATGTCTTCA TGAC^TCATC ACTGCCCCAG GCCCAGCTGT S^CCCTACAC 5580 

70 CAGCCCCaGC TQACQCATCrr TCTTTTTCrG CCTGTAGAGA TGGTTACAAT GCCTGGCGTG 5640 

MgStOTG GCCTTCGCAG ATCCTQQCGG GGCTGTGCCA ACGCTGTGGC CTCCCTGCCC 5700 

CTQAATAcS AGCOSGTGCT GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACCGGAGA 5760 

iss =^ s= Js=s isss ssss 

i™ ssss =s sass s=s ssss s 
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GAGGCGGAGT TCCGGCRGCC TGCAGTGCTG GTCCTGCAGG ATCCCTGGAG TTGCAGCTAC 6180 

CAGACATGGT GCGTGGGGCC CGGGGCCCCG AGCTCTGCTC TGTGCAGCTG GCCCGCAATG 6240 

GGGCCGGGCC GAGGTGCAAT CTGTTTCGCT GCCGCCGCCT GAGGGGCTGG TGGCrGGTAG 6300 

TGAAGCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGCGCA GGAGGCTCAG SCTGGCAAGA 6360 

AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC GGCCAGAAGA CCTGGAGTTC ACAGACATGG 6420 

GTGGCAATGT GTACATCCTC ACGGGCAAGG TGGAGGCAGA GTTTGAGCTG CTGACTGTGG 6480 

AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGQGGAAGCA GCCAGAGCCT CTGGAGAARC 6540 

CCAGCCGCCC CAAAACTTCC TTCAACTGGT TTGTGAACCC GCTGAAGACC TTTGTCTTCT 6600 

TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT ACTGGTGCTG CTCACCGTCT 6660 

TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAGATCAG CCRGGTCATC TTCCGTCCCC 6720 

3 ACTCTCGCTG ACCTTGGACA CTCACCOWSG GTGCCAACCC TTCAATGCCT 6780 

A GTCTTTCTTA CCCATQTGAG CTACCCCaGR GTCTAGTGCT TCCTCTGAAT 6840 



1 1 I 1 1 I 

MCKRRWHWPR SSWQNCAFWL RRHPGQPLVR SVPSWSSSOG WAWASKPRPA PLSi-VKH-c-r 
SPQPGCPPAY TGTVLEQTLS PLWDBLLVFE QLIVDGRREH LQEEPPLVII MVFDHNKFGP 
PVFLGRALAA PRVKLMBDPY QRPELQFFPL RKGPWAAGEL lAAFQLIELD YSGRLEPSVP 
SEVEPQDLAP MEPHSGRLS LPPNVCPVLR BFRVBVLFWG LRGLGRVHLL EVEQPQWLE 
VAGQGVBSEV LASYRESPNF TELVRHLTW FKDTAPLFHP QDLPEQPYLQ PPLSILVIER 
RAFGHTVLVG SHIVPHMLRF TFRGHEDPPE EEGEMEETGD MMPKGPQGQK SLDPFLAEAG 
ISRQLLKHNF DEDEMDDPGD SDGVNLISMV GBIQDQGBAE VKGTVSPKKA VATLKIYNRS 
LKBEFNHFED WLNVFPLYRG QGGQDGGGEE EGSGHLVGKF KQSFLIYPBS BAVLFSBPQI 
SRGIPQNRPI KLLVRVYWK ATNLAPADPN GKMPYWVS AGRERQDTKE RYIPKQLNPI 
FGEILELSIS LPAETELTVA VFDHDLVGSD DLIGBTHIDL ENRFYSHHRA NOSLASQYEV 
WVQQGPQBPF 

Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM_002449.2 

Coding sequence: 223.. 1026 

1 11 21 31 41 51 

GG3GGGGGGG GGCAGCCTCT CGGGAAGAGC CAATCAGGQG CGAGCGTCTT CTCGTCGCAC 
GAGGCCCGGC GCGGATTGGC GGCGCBCGTC TC(XACTrCC CCTCGGAGGA AAGGCTCAGC 
TCCCAQCGCG CCCCTCCCGT CTCCGCAGCA AAAAAGTTTG AGTCQCCQCT GCCGGGTTGC 
CAGCGGAGTC GCGCGTCGGG AGCTACGTAG GGCXGAGAAG TCATGGCTTC TCCGTCCAAA 
GGCAAT3ACT TGTTTTCGCC CGACGAGGAG GQCCCaGCAG TGGTGGCCGG ACCAGGCCCG 



QGGCTGGGGG GCGCCGCGGG GGCCGCGGAG GAGCGCCGOG TCAAGQTCTC CAGCCTGCCC 
TTCAGCGTGG AGGCGCTCAT GTCCGACAAG AAGCOSCCCA AGGAGTCGCC CGCTGTGCCT 
CCCGAAGGCG CCTCGGCCGG GGCCCACCTG CGGCCACTGC TGCTGTOSGG GCACCGOSCT 
45 CGGGAAGCGC ACAGCCCCGG GCCGCTGGTG AAGCCCTTCQ AGACCGCCTC SS^CAAGTCG 
GGAAATTCAG AAGATGGAGC GGCGTGGATG CAGGAACCCB GCCGATATTC GCCGCCGCCA 
RGArAl-fiTGA GCCCTACCAC CTGCACCCTG AGGAAACACA AGACCAATCG GAAGCCGCGC 
ACGCCCTTTA CCACATCCCA GCTCCTCGCC CTGGAGCGCA AGTTCOSTCa GAAACAGTAC 
CTCTCCATTG CAGAGCGTGC AGAGTTCTCC AGCTCTCTGA ACCTCACAGA GACCCASGTC 
50 AAAATCTGGT TCCAGAACCG AAGCGCCAAG GCGAAAAGAC TGCAGGAGGC G^CCaaM. 

AAGCTGAAAA TQGCTGCAAA ACCTATGCTA CCCTCCA.GCT ICAGTCTCCC CTTCCCCATC 
AGCTCGCCCC TGCAGGCAGC GTCCATATAC GCAGCATCCT ACCCGTTCCA TAGACCTGTG 
CTTCCCATCC CQCCCGTGGG ACTCTATGCC ACGCCAGTGG HATATGGCAT QTAC.t.At.t.1^ 
TCCTAAGQAA GACCAGATCA ATAGACTCCA TGATGOA.TGC ™TTTCAAA ^^TTCCTC 

ACCCCACCCC CTCTAACGGC TAGCTGACAG GGCCACACCR CATAGCTGAA S 
GTAGGCGGAG GCACCAAGCC CTGCTTTTCT TGGTGT/JvCT TCCAGAGTCC C 
CCCTTGCACA AAAGCTTGGC TCTGATGGTT TTTTTQGCAT GAIGT'"^"-^' ^ 



CCCTTGCACA AAAGCTTGGC TCTGAlt,t.ll 1 1 . i 

AAAAATACTA CAGACKCTTT TTATCAGCAG ACGTAAAAAT TCAAA.TATT TTAAAAGGCA 
(SO AAATTTATAT ACATATQTQC TTTTTTTCTA TATCTCACCT TCCCAAAAAG ACACATGTGT 
^?^TTT GTTGTATTTT CTTAAAGAGG GAGACAAATT CGGAGGAGCG CCGCGTCAAG 
CTCTCCAGCC TGCCCTTCAG CGTGGAGGCG CTCATGTCCG ATTTGCAAAA ATGTGCTAAA 
GTCAATGATT TTTACCQQGA TTATTQACTT CTGCITATAC AAGAAGCCGC CCAAGGAGTC 
GCCCGCTGTG CCTCCCGAAG GCGCCTCGGC CGGCCTGCGG AA.AAAC.oAAA GAAA.^CAGAC 
65 ACRATQCAGC AGCCAGAAAA TATTAGATAT GGAGAGATTA TGGCCACTGC TGCT^CCGG 
CCACGGQGTC CGGGAAGCGC ACAGCCCCGG GCCGCTGGTG ATCAAAGTGA ^CCCACATCA 
TATTTCTGCA TTTTACTTGC ATTAAAAQAA ACCTCTTTAT AAGCCCITCG AGACCGCCTC 
gScAAOTCG GGAAATTCAG AAGATGGAGC GGCGTGGATG CTACATACGT TGTTCCTATC 
TCCCGCCCAC GCCCACACAT ATTTTTAAAG TTTTTAGGAA CCCGGCCGAT ATTCGCCGCC 
70 GCCAAGACAT ATGAGCCCTA CCACCTGCAC CCTGACCTTT TTAAGAATAT TTTTGTAAGA 
CCAMACCTG GGATGAGAAG AATCCGTAGA CTGCCGGRAA CACAAGACCA ATCGGAAGCC 
GOGCACeCCC TTTACCACAT CCCAGCTCCT CGCCCTGGAG STGAGGTAGA AAAATTAGAA 
ATACTTCCTA ATTCTTCTCA AQQCTGTTGG TAACTTTGGA GCGCAAGTTC CGTCAGAAAC 
AGTACCTCTC CATTGCAGAG CGTGCAGAGT TCTCTATTTC AGATAATTGG AGAGTAAAAT 
75 GTTAAAACCT GTGAGASGAT TGTACAGCTC TCTGAACCTC ACAQACCCAG QTCAAAAGGT 
?CTGAGAAAT ACTAGGTACA TTCATCCTCA CAGATTGCAA AGGTGCTTTG GGTGGGGQTT 
TAGTAATTTT CTGCTTAAAA AATGAGTATC TTGTAACCAT TACCTATATC TAAATATTCT 
TGAACAATTA GTAGATCCAG AAAGAAAAAA AAAATATGCT TCTCTGTQTG TGTACCTGTT 

254 
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QTATGTCCTA ACTTATTAGA AAAATTTTAT ATCTTTTTAC ATGTGGGGGG CAGAAGGTAA 2580 

AGCATGTTTG ACTTGTGAAA ATGGGATQTC AAACAGCCAT AAGTTCCCTG GTATTCACCT 2640 

TCCTGTCCAT CTGTCCCCTC CATCGGTATA CCTTTATCCC TTTGAAAGGG TGCTTSTACA 2700 

ATTTGATATA TTTTATTGAA GAGTTATCTC TTATTCTGAA TTAAATTAAG CATTTGTTTT 2760 
5 ATTCTGAATT AAATTAAGCA TTTGTTTTAT TGCAQTAAAG TTTGTCCA 



MASPSKGNDL FSPDEEGPAV VAGPGPGLGG AAGAAEERRV KVSSLEFSVE ALMSDKKPPK SO 

ESPAVPPEGA SAGAHLRPLL LSGHEAREAH SPGPI.VKPFE TASVKSGNSE DGAAWMQEPG 12 0 

RYSPPPRHMS PTTCTIiRKHK TNRKPRTPFT TSQLLAI.ERK FRQKQYLSIA ERAEFSSSLN 18 0 

1 5 LTETQVKIWF QNRSAKAKRL QEAELEKLKM AAKPMLPSSF SLPFPISSPL QAASIYAASY 240 
PFHRPVLPIP PVGLYATPVG YGMYHLS 



25 ATQAQTACAG AAAGAGACTC AGAAACGACa TTTGAOGAGG ATTCTCAGCC TAATGACSAA 60 

GTGGTTCCCT ACAGTGATQA TQAAACAGRA GATGAACTTG ATGACCAGGG GTCTGCTGTT 120 

GAACCAGAAC AAAACOSAGT CAACAGGGRA GCAGAGGAGA ACOSGGAGCC ATTCAGAAAA 180 

GAATQTACAT GGCAAGTCRA AGCAAACGAT CGCAAGTACC ACGAACAACC TCACTTTATG 240 

AACACAAAAT TCTTGTOTAT TAAGGAGAGT AAATATGCGA ATAATGCAAT TAAAACATAC 300 

30 AAGTACAACG CATITACCTT TATACCAATG AATCTGTTTQ AGCAGTTTAA GAGAGCAGCC 360 

AATTTATATT TCCTGGCTCT TCTTATCTTA CAGGCAGTTC CTCAAATCTC TACCCTGGCT 420 

TGGTACACCA CACTAGTGCC CCTQCTTGTG GTGCTGGGOS TCACTGCAAT CRBAGACCTG 480 

GTGGACGATG TGGCTOSCCA TAAAATGGAT AAGGAAATCA ACfiATAGGAC GTGTGAAGTC 540 

ATTAAGGATG GCAGGITCAA AGTTGCTAAG TGGAAAGAAA TTCAAGTrOQ AQACQTCATT 600 

35 CGTCTGAAAA AAAATSATTT TQTTCCAGCT GACATTCTCC TGCTGTCTAG CTCTGAGCCT 660 

AACAGCCTCT GCTATGTGGA AACAGCAGAA CTQQACGQAG AAACCAATTT AAAATTTAAG 720 

ATGTCACTTG AAATCACAGA CCAGTACCTC CAAAGAGAAG ATACATTGGC TACATTTGAT 780 

GGTTTTATTG AATGTGAAGA ACCCAATAAC CGACTAGATA AGTTTACAGG fiACACIATTT B40 

TGGAGAAACA CAAGTTTTCC TTTGGATGCT GATAAAATTT TGTTAOSTGG CTGTSTAATT 900 

40 AGGAACACCG ATTTCTGCCA CGGCTTAGTC ATTTTTGCAG GTGCTQACAC TAAAATAATG 960 

AAGAATAGTG GGAAAACCAG ATTTAAAAGA ACTAAAArTG ATTACTTGAT GAACTACAM 1020 

GTTTACACGA TCTTTGTTGT TCTTATTCTG CTTTCTGCTG CJTCTTGCCAT CGGCCATGCT 1080 

TATTGGGAAG CACAGGTGGG CAATTCCTCT TGGTACCTCT ATQATGGAGA AGAOSATACA 1140 

CCCrCCTACC GTQ3ATTCCT CATTTTCTGG GGCTATATCA TTGTTCTCAA CACCATGQTA 1200 

45 CCCATCTCTC TCTATGTCAG CGTGGAAGTG ATTCGTCTTG GACAGAGTCA CTTCATCAAC 1260 

TGGGACCTGC AAATGTACTA TGCTGAGAAG GACf.CACCCG CAAAAGCTAG AACCACCACft IJ^U 

CTCAATGAAC AGCTCGGGCA GATCCATTAT ATCT.-cnci'G ATA..GACGGG GACACTCACA 1380 

CAAAATATCA TGACCTTTAA AAAGTQCTGT ATC/ACGGGC AGATAIATGG GGACCATOGG 144D 

G^CtSc AACACAACCA CAACAAAATA GAGCAAGTTG ATTTTAGCTG GAATACATAT 1500 

50 GCTGATGGQA AGCTTGCATT TTAT3ACCAC TATCTTATTG AGCAAATCCA GTCAGGGAAA 1560 

GAGCCAGAAG TACGACRGTT CTTCTTCTTG CTCGCAGTTT GCCACACACT CATGGTGGAT 1620 

AGGACTGATG GTCAGCTCAA CTACCAGGCA GCCTCTCCCG ATGAAGGTGC CCTGGTAflflC 1680 

gSqCCAGQA ACTTTGGCTT TGCCTTCCTC GCCAGGACCC AGAACACCAT CACCATCAGT 1740 

GAACTGGGCA CTGAAAGGAC TTACAATGTT CTTGCCATTT TGGACTTCAA CAGTGACCGG 1800 

55 AAGCGAATGT CTATCRTTGT AAGAf.CCCCA GAAGGCAATA TCAAGCTTIA CTQTAAAGQT 1860 

OPTOACACTG TIATTTATGA ACGGTTACAT CGAATGAATC CTACTAAGCA AGAAACACAG 1920 

GMGCCCTGG ATATCTITGC AAATGAAACT CTTAGAACOC TATGCXTTTG CTACAAQGAA 1980 

ATTGAAGAAA AAGAATITAC AGAATGGAAT AAAAAGTTTA TGGCTGCCAG TQTGGCCTCC 2040 

ACCAACCGGG ACGAAGCTCT GGATAAAGTA TATGAGGAGA TTGAAAAAGA CTTAATTCTC 2100 

60 CTGGGAGCTA CAQCTATTGA AGACAAGCTA CAGGATGGAG TTCCAGAAAC CATTTCAAAA 2160 

CTTGCAAAAG CTGACaTTAA QATCTGGGTG CTTACTGGAG ACAAAAAGGA AACTGCTGAA 222 0 

^ATAGGAT TTGCTTGTGA ACTTCTGACT GAAGACACCA CCATCTGCTA TGGGGAGGAT 2280 

SStC TTCTTCATGC AAGGATGGAA AACCAGAGGA ATAGAGGTGG CGTCTACGCA 2340 

^OTIGCAC crCCIGTGCA GGAATCTTTT TTTCCACCCG GTGGA.^CCG TGCCTTAATC 2400 

65 MCACTGGTT CTTGSrTGAA TGAAATTCTT CTCGAGAAAA AQACCMGAG AAATAAGATI 2460 

S^SgI AGTT?cSaG AACAGAAGAA GAAAGACGGA TGCGGACCCA AAGTAAAAGG 2520 

AGGCTAGAAG CTAAGAAAGA QCAGCGGCAG AAAAACTTTG TGGACCTGGC CTG03AGTGC 2580 

AGCGCAGTCa TCTGCTGCCG CGTCACCCCC AAGCAGAAGG CCATGGTGGT GGACCTGGTG 2 64 0 

^Sim AGRAAGCCAT CACGCIGGCC ATCGGAGATG QGGCCP^TGA CGTGAACATG 2700 

70 irCAAAACTG CCCACATTGG CGTTGGAATA AGTGGACAAG MGGAATGCA AGCTGTCATG 2760 

T?SaS AT™gC TCAGTTCCGA TATCTGCAGA GGCTACTGCI GGTGCATGGC 2820 

rviTfSGTCTT ACATAAGGRT GTGCAAGTTC CTRCGATACT TCTOTTACAA AAACTTTGCC 2880 

^tIctt?S TicATTTCTG GTACTCCTTC TTCAATQGCT ACTCTGCGCA GACTGCATAC 2 940 

rifGATTGGT TCATCACCCT CTACAACGTG CTGTACACCA GCOTGCCCGT GCTCCTCATG 3000 

75 GGGCTGCTCG ACCAGGATGT GAGTGACAAA CTGAGCCTCC GATTCCCTOG GTTATACATA 3060 

g?gggaSS g^™ attcractat aagagattct -rrGTRAGcn 3120 

GTCCTAACAT CGATGATCCT CTTCTTCATA CCTCTTGGAG CTTATCTGCA AACOGTRGGG 3180 
iG AGGCACCTTC CGACTACCAG TCTTTTGCCG TCRCCATTGC CTCTGCTCTT 3240 
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GTAATAACAG TCAATTTCCA GATTGGCTTG 
TTTTCAATTT TTGGAAGCAT TGCACTTTAT 
GGftfiTACATG TTCTCTTTCC ATCTGCATTT 
AGACAGCCAT ACATTTGGTT AACTATCRTC 
5 GTTGCCATTC GATTCCTGTC AATGACCATC 
CATCGCAAGC GGTTGAAGGC GGAGGRGCAG 
GGCGTGTCAA CGOGGQSCTC GGCCTACGCC 
ATCTCCTCCG GGOGCAGCAT CCGCAAGAAG 
GGCACOSCGG AGTACAGGCG CACCGGGGAC 



GATACTTCrr ATTGQACTTT TQIGAATGCT 3300 

TTTGGCATCA TSTTTGACTT TCATAGTGCT 3360 

CRATTTACAG GCRCAGCTTC AAACGCTOTG 3420 

CTGACTGTTG CTGTQTGCTT ACTACCCSTC 3480 

TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

TGGCRGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

TTCTCGCACC AGCGGGGCTA CQOGGACCTC 3660 

CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 3720 
AGCTGA 



WbTKKUbKTT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRWRE AEENRBFFRX 
ECTWQVKAND RKVHEQPHFM NTKFLCIKES KYANNAIKTY KYUAFTFIFM NLFEQFKPJiA 
NLYFLALLIL QRVPQISTIjA WYTTLVPLLV VLGVTAIKDL VDDVARHIv--a3 KEIIWRTCEV 
IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP KSLCY\'ETAE LDQETNLKFK 
MSLBITDQYL QREDTLATFD GFIECEEPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 
RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 
YWEAQVGNSS WVLYDGEDDT PSYRGFblFW GYIIVLNTMV PISLYVSVEV IRLQQSHFIN 
WDLQHYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 
DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 
RTDGQLNYQA ASPDEGALVN AAHNFGFAFL ARTQNTITIS ELGIERTYNV LAILDFNSDR 
KHMSIIVRTP BGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANE" LRTLCLCYKE 
lEEKEFTEWN KKFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 
LAKADIKIWV LTGDKKETAE HIGFACELLT BDTTICYGED IN3LLHARME NQRNRGGVYA 
KFAPPVQESF FPPGGNEAI.I ITGSWLNEIL LEKKTKRNKI LKLKFPSTEE ERRMRTQSKE 
RLEAKKEQRQ KNFVDLACEC SAVICCSVTP KQKAIWVDLV KRYKKAITLA IGDGANDVNM 
IKTSHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNFA 
FTLVHFWYSF FNQYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 
VGQRDLLFNY KRFFVSLLHG VI/TSMILFFI PLGAYLQTVG QDGEAPSDYQ SFAVTIASAL 
VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAP QfTGTASNAL 114U 
RQPYIWLTII LTVAVCLLPV VMRFLSMTI WPSESDKIQK HRKRLKREEQ WQRRQQVFRR 1200 
GVSTRRSAYA FSHQRGYftDL ISSQRSIRKK ESPLDAIVAD GTAEYRRTGD S 



Seq ID NO: 71 DNA sequence 
Nucleic Add Accession NM. 



1140 



1 I I I 1 I 

QQGGATACGG AAAACAGAAT AGAAGCACAG AACfiAGGACT ATTQTATATA AGAGAAGCAG 60 

GTGTTAATGT AAGTTGCTGT GGAATGACTG TTTTCAAGAA GAGAGftAGGA GCCAATATAT 120 

CTACTTTCAT -[-AAGAAGCTG AAATTAAATT TGGATTTCCT TCTTTCCAAA GCTGAGACAC 180 

GATGGCCAAA AAGCGATTGC TGTGATTGGA GCTGGAATTA GOKACTGGG GGCCATCAAG 240 

TGCTGCCTGG ATGAAGATCT GGAGCCCACC TGCTTTGAAA GAAATGATGA TATTQ^CAT 300 

CTCTGGAAAT TTCAAAAAA-^i TACTTCAGAG AAAATGCCTA GTATCTACAA ATCTGTGACC 360 

ATCAATACTT CCAAGGAGAT GATGTGCTTC AGTGACTTCC CTGTCCCTGA TCATTTTCCC 420 

aArraCATGC ACAACTCCAA ACTCATGGAC TACTTCGGGA TGTATGCCAC ACACTTTGGC 480 

^CCTG^?? AC^TCG??? tSgACTGAA GTGCAAAGTG TGAGGAAGCA CCCAGATTTT 540 

TCTATCAATG GACAATGGGA TGTTGTTGTG GAGACTGAAQ AGAAACAAGA SACTTTGGTC 600 

TTTGATQGQG TCTTAGTTTG CAGTGGACAC CACACAGATC CCTACTTACC ACTTCAGTCC 660 

TTCCCAGGCA TTGAGAAATT TGAA.GGCTCT TATTTCCATA GTCGGGAATA CAAAAGTCCC 720 

GAGGACTTTT CAGGGAAAAG AATCATAGTG ATCGGCATTG GAAATTCTGG AGTGGATATT 780 

GCQQTGGAGC TCAGTCGTGT AGCAAAACAG ATTTGACCAG TAGGAACTGA GATTCSAOGT 840 

CCCCAAGACT GCAAAAGTAC CTACAATCAA ACAAATTAAT ATATTTCCTA ATTGGATCAG 900 

CATGCAGACA AGTTTACA.'iA TATATTACAG AAAGATTTGA AGAGGGATTC TGGTTACGTT 960 

CAGTTAAGCA GGACACTAAT CCTGAACAAT CIGATGTATT AAAGTTTCTA ^TGTTTGTAT 1020 

CTTTGGnTG ATGCATACAA TAGAGTGGAA GTCTGTAGTA GTTAATATGA TGATAATTCG 1080 

CTTCTAAATT TAAATCCTTA TTTATATATG AAGCATAGCT ATGTCAATGT CCAGGGAAGC 1140 

cIsIIImG ™^tSa S^S^ A^AAAT CAGTGGGCAG CTCTQQTCAA 1260 

GCACAGCATG GCGACCACTG TGTAGTATTT TACCACCTAA TCCTTAGTAG CCAACACTGT 1320 

TAAATCTGCA TAATGGGTAG CATATGAATA AAATftATATC TTGTAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAA 



Seq ID HOs 72 Protein sequence: 



75 MPSIYKSVTl HTSKEM4CFS DFPVPDHFPN YMHNSKLMDY FGMYATHFGL LNYIRFKTEV 

QSVRKHPDFS INGQWDVWE TEEKQETTLVF DGVLVCSGHH TDPYLPLQSF PGIEKFEGCY 
FHSREYKSPE DFSGKRIIVI GIGNSGVDIA VELSRVAKQI 
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I I 1 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG A 

C CAAGTGGGAC AGATACTQGA GATCCTCAA.^^ G 

rCGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 
AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGR C7GATAAT3A 
GGATTCGTGG CTGCTGCTGR ACTGCCCAGG AATGAGGCAG ATGAGCTCCG 
GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA A 
TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTC-A GC-ATAACATA 
AGAAGGCTCC GTGCCCTTGC AGATGGQGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 
AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CQGCATGGGT 
CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTT3G AACCTGGGAT GQAGTTQGGA 
ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGQ ACTACGG.^AA GAAGTQGTGG 
ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 
GAGTTTTTGG GTGA3AACAT ATCCAACTTT CTTTCCTTAG CT3GCAATAC TTACCAACTC 
ACACQAQGCA TTGGGAAGGA CATCCGTGCC -CTCAGACaAG CCAGAGGCAA TCTTCAGTCA 
GTACCGCaTG CCTCRGCCTC AOSCCCCaSG GTCACTGAGC CAATCTCAGC t-"--"— 
GRACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA T 
ACOQATarOG CCCCTGTAAG CTTCTTTCTT GTGCTGGRTG TAGTCTACCT C 
TCAAAGCACT TACATGAGGQ GGCAAAQTCA GAGACAGCTG AGGAGCTGAA G 
CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ft— " 
CAAGAACTQT GA 

Seq ID NO! 74 Protein Sequen 



MSALFLGVGV EABEAQARVQ QNVPSGTDTG DPQSKPLGDW AAGTMDPESS IFIBDAIKYF 
KEKVSTQNLL LLLTDNEAVW GFVAAAELPR NBADBLRKAL DNLARQMIMK DKNWHDKGQQ 
YRM'^JFLKEFP RLKSELEDMI RRLRALADGV QKVHKGTTIA NWSQSLSIS SGILTLVGMG 
LAPFTEGGSL VLLBPGWELG ITAALTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKBVR 
EFLGEMISNF LSLAGNTYQL TRGIGKDIRA LREABANLQS VPHASASRPH VTEPISAESG 
EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYB SKHLHBGAKS ETAEELKKVA 
QBLEEKLNIL NNNYKILQAD QEL 

Seg ID NOs 75 DMA sequence 

Nucleic Acid Accession #= NM_014452.1 

Coding sequence: 1..196a 

1 Y T T r T 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCX:GCCGA 

GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CAC^CCACA 

GCTCAGCCAG AACAGAAGGC CTCGAA.TCTC ATTGGCACAT ACCGCCATGT TGACCQTGCC 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 

ACCAACACAA GCCTGCGCGT CTGCRGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 
AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGfGTGf =AAGAAAGGG 



AACGCTACCT GTGCCCCCCA TACGGTl^l.t=l 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 
TCTASrCTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 
AAGCOSGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCICC 
ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 
rcr.ioTrraA rTCTTCTGCC 



ACCTCACCTT CCCCTGGCAC AGCCATCTTT cc 11= «u^n^io-r. 
GTCQCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 
TCTSITAGAC CAAAGGTACT GAGTAQCATC CAGGAAGGGA CAGTCCCTGA CAflCACAAGC 
TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTICAGGT AGTCAACCAC 
CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACIGGG 
GGCGAGAAGT CCAGCACGCC CATCAAOQGC CCCAAGAGGG OACATCCTAG AGAGAACCTA 
CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCIGCTG 
GTGCTTGTGQ TQATTGTGGT GTGCAGTATC CGGAAAAGCT CGA3GACTCT GAAAAAGGGG 
CCCCGGCAGG ATCCCAQTGC CATTQTGGAA AAGGCAGGGC TaM^ShAATC CATGACTCCA 
AOCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 
CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCA.GTTTCT TTGC.AA.TGCC 
AGTGAGAGGG AGQTTQCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA 3CGGGCCTAC 
GCAGCTCTGC AGCACTGGAC CATCCOaOQC CCOGAGGCCA GCCTCGCCCA 3CTAATTAGC 
GCCCTGCGCC AGCACCGGRG AAACGATGTT GTGGAGAAGA TTCGTGGGCT 3ATG3AAGAC 
ACCACCCAGC TQQAAACTGA CAAACTAGCT CTCCCGATGA GCCCCA3CCC GCTTAGCCCG 
AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCrCCTGAC GGTGGAGCCT 
TCCCCACAGG ACAAGAACAA CSGGCTTCTTC GTSGATGAOT CGGAGCCCCT TCTCCGCTGT 
GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 
AAGAAGGACA CAGTGTTQCQ GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 
GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGOQ TGATTGAAGA GATTCCCCAG 
GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTQGAG TCAAQAQCCA QOAAGCCAGC 
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CAGRCCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTCTAG 



111111 
MQTSPSSSTA LA3CSRIARR ATATMIAG— ' 
TGQVLTCDKC PAGTYVSEHC TNTSLRVC 
LPCAALTDRE CTCPPGMFQS NATCAPHT 
SSVMKCKAYT DCLSQNLWI KPGTKETDNV CGTLPSFSSS TSPSPGTAIF PRPEHMETHE 
VPSSTYVPKG MNSTESNSSA SVRPKVLSSI QEGTVPDNTS SARGKEDVNK TLPNLQWNH 
QQGPHHRHIL KLLPSMEATG GEKSSTPIKG PKRGHPRQNL HKHFDINEHL PMMIVLFLLL 
VLWIWCSI RKSSRTLKKG PRQDPSAIVE KAGLKKSMTP TQNREKWIYY CNGHGIDILK 
LVAAQVGSQW KDIYQFLCNA SERBVAAFSH GYTADHBRAY AALQHWTIRG PERSLAQLIS 
ALRQHRENDV VEKIRGLMED TTQLETDKI.A LPMSPSPLSP SPIPSPNAKL ENSALLTVBP 
SPQDKNKGFF VDESEPLLRC DSTSSGSSAL SHNGSFITKE KKDTVLEQVR LDPOJLQPIF 
3 ELRVIEEIPQ ABDKLDRLFB IIGVKSQBAS QTLLDSVYSH LPDLL 



GGCGCGGGCG GCCTGGAGCC CCGGGAGCGG QSOSCGCGGT CCCGGCCCAG CCGGCTCTCC 60 

TGGCCTCGCG CTGCACaTTC TCTCCTGGCG GOSGCGCCAC CTGCAGTAGC GTTCGCCCGA 120 

ACATGGCGAC ACGGAGCAGC RGGAGGGAQT OSCGACTCCC GTTCCTATTC ACCCTGGTCG 180 

CACTGCTGCC GCCCGGAGCT CTCTGCGAftG TCTGGACGCA GAGGCTGCAC GGCGGCAGCG 240 

30 CQCCCTTQCC CCAGGRCCGG GGCTTCCTCG TGGTGCaGGG CGACCCGCGC GAGCTGCGGC 300 

TGT3GGCGCG OSGGGATGCC AGGQGGGCGA GCCGCGCGGR CGAGAAGCCG CTCOSGAGGA 360 

AACGGAGCGC TGCCCIGCAG CCCGRGCCCA TCRAGGTGTA CGGACAGGTT AGTCTGAATG 420 

ATTCCCACAA TCMATGGTG GTGCACTGGG OTGGAGAGAA AAGCAAOGTG RTOGTGGCCT 480 

TGGCCCGAGA TAGCCTGGCA TTGGCGAGGC CCAAGRGCAG TGATGTGTAC GTGTCITACG 540 

35 ACTATGGAAA ATCATTCAAG AAAATTTCAG ACftAGTTAAA CTTTGGOTTG GGAAATAGGA 600 

QTGAAGCTGT TATCGCCCAG TTCTACCACA GCCCTGCGGA CAACAAGOSG TACATCTTTQ 660 

CAGACGCTTA TGCCCAGTAC CTCTGGATCA CGTTTGACTT CTGCAACACT CTTCMGGCT 720 

TTTCCATCCC ATTTCGGGCR GCTGATCTCC TCCTACACAG TRAGGCCTCC AACCTICTCT 780 

T3QGCTTTGA CAGGTCCCAC CCCAACAAGC AGCTGTGGAA GTCAGATGAC TTTGGCCAQA 840 

40 CCTGGATCAT GATTCAGGAA CATGTCAAGT CCTTTTCTTG GGGAATIGAT CCCTATGACa 900 

AACCAAATAC CATCTACATT GAACGACACG AACCCTCTGG CTACTCCACT GTCTTCCGAA 960 

GTACAGATTT CTTCCAGTCC CGGGAAAACC AGGAAGTGAT CCTTGAGGAA GTGAGAGATT 1020 

TTCAGCTTCG GGACAAGTAC ATGTTTGCTA CAAAGOTGGT GCATCTCTTG GGCaGTGAAC 1080 

AGCAGTCTTC TGTCCAGCIC TGGGTCTCCT TTGGCCGGAA SCCCRTGAGA GCAGCCCAGT 1140 

45 TTGTCACAAG ACATCCTATT AATGAATATT ACATCGCAGA TGCCTCCGAG GACXaGGTGT 1200 

TTGTGTQTQT CAGCCACAGT AACAACCGCA CCAATTTATA CATCTCAGAG GCAQAQGGGC 1260 

TGAAGTTCTC CCTGTCCTTG GAGAACGTGC TCTATTACAG CCCAGGAGGG GCOQGCAGTG 1320 

ACACCTTGGT GAGGTATTTT GCAAATGAAC CATTTGCTGA CTTCCACCGA GTGGAAGGAT 1380 

TQCAAGGAGT CTACATTGCT ACTCTGATTA ATGGTTCTAT GAATGAGGAG AACATGAGAT 1440 

50 CGGTCATCAC CTTTGACAAA GGGGGAACCT GGGAGTTTCT TCAGGCTCm GCCTTCACGG 1500 

GATATGGAGA GAAAATCAAT TGTGAGCTTT CCCAGGGCTG TTCCCTTCAT CTGGCTCAGC 1560 
GCCTCAGTCA GCTCCTCAAC CICCAGCTCC GGAGAATGCC CATCCTGTCC 
CTCCAGGCCT CATCATCGCC ACTGGCTCAG TGGGAAAGAA CTTGGCTAGC 
TGTACATCTC TAGCAGTGCT GGAGCCAGGT GGCGAGAGGC ACTTCCTGGA 
55 ACACATQQGG AGACCACGGC GGAATCATCA CGGCCATTGC CCAGGGCATG 

AGCTAAAATA CAGTACCAAT GAAGGGGAGA CCIGGAAAAC ATTCATCTTC T.TGAGAAGC 1360 
CAGTGTTTGT QTATGGCCTC CTCACAGAAC CTGGGGAGAA GAGCACTGTC TICACCATCT 1920 
TTGGCTCGAA CAAAGAGAAT GTCCACAGCT GGCTGATCCI CCAGGTCAA.T GCCACGGATG lyBU 
CCTTGGGAGT TCCCTGCACa GAGAATGACT ACAAGCTGTQ GTCACCATCT GATGAGCGGG 2 040 
60 GGAATGAGTG TTTGCTGGGA CACAAGACTG TTTTCAAACG GCGGACCCCC CATGCCACAT 2100 

GCTTCAATGG AGAGGACTTT GACAGGCCGG TGGTCQTGTC CAACTGCTCC TGCACCCGGG 216 0 
AGGACTATGA GTGTGAOTTC GGTTTCAAGA TGAGTGAAGA TTTGTCATTA GAGGTTTGTG 2220 
TTCCAGATCC GGAATTTTCT GGAAAGTCAT ACTCCCCTCC TGTGCCITGC CCTGTGGGTT 2280 
CTACTTACAG GAGAACGAGA GGCTACCGGA AQATTTCTGG GGACACTTGT AGCGGAGGAG 234 0 
65 ATGTTGAAGC GCGACTGGAA GGAGAGCTGG TCCCCTGTCC CCTGGCAGAA GAGAACGAGT 2400 

TCATTCTGTA TQCTarOAGG AAATCCATCT ACCGCTATGA CCTGGCCTCXS GGAGCCACCG 24SC 
AGCAGTTGCC TCTCRCCGGG CTACGQQCAG CAGTGGCCCT GGACTTTGAC TATGAGCACA 2521 
ACTGTrrGTA TTGGICCGAC CTGGCCTTGG ACGTCATCCA QCQCCTCTGT TTGAATGGAA 2 58 
GCACAGGGCA AQAGGTGATC ATCAATTCTG GCCTGGAGAC AGTAGAAGCT TTGGCTTTTG 264C 
70 AACCCCTCAG CCASCTGC3T TACTGGGTAG ATGCAGGCTT CAAAAAGATT GAGGTAGCTA 270C 
ATCCAGATGG CGACTTCOSA CTCACAATCG TCAATTCCTC TGTGCITGAT CGTCCCAGGG 27 6 C 
CTCTGGTCCT CGTGCCCCSiA GAGGGGGTGA TGTTCTGGAC AGACTGGGGA GACCTGAAGC 28 2 C 
CTOGOATTTA TCGGAQCAAT ATGGATGGTT OTGCTGCCTA TCACCIGGTG TCTGAGGATG 2881 
TGAAGTGGCC CAAIGGCATC TCTGTGGACG ACCRGTGGAT TTACTGGACG GATGCCTACC 2 94 
75 TGGAGTGCAT AGAGCGGATC AOSTTCaGTG GCCAGCAGCG CTCTQTCATT CTGGACAACC 3 00 

TC-CCGCACCC CTATGCCaTT GCTGTCTTTR AGAATGAAAT CTACTGGGAT GACIGGTCAC 306 
AGCTCAGCAT ATTCCGA3CT TCCAAATACA GTGGGTCCCA GATGGAGATT CTGGCAAACC 312( 
AGCTCACGGG GCTCATGGAC ATGAAGATTT TCTACAAGGQ GAAQAACACT GGAAGCAATG 3ia( 
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^. , CTGCTGT GCCTGCCCAA GGCCAACAAC AGTAGASGCM 3240 

GCAGGTGTCC AGAGGATGTG TCCAGCAGTG TGCTTCCATC AGGGGACCTG ATGTGTGACT 33 00 

:A TCGCTGCAGC AACGGGAACT GTATCAACAG CATTTGGTGG TGTGACTTTG 3420 

:G TGGAGACATG AGCGATQAGA GAAACTGCCC TACCACCATC TGTGACCTGG 3480 

A^A^^^./iL=i'T TCGTTGCCAG GAGTCTGGGA CTTGTATCCC ACTGTCCTAT AAATQTGACC 3540 

TTGAGGATGA CTGTGGAGAC AACAGTGATG AAAGTCATTG TGAAATGCAC CAGTGCCGGA 3600 

GTGACGAGTA CAACTGCAGT TCCGGCATGT QCATCCGCTC CTCCTGGGTA TGTGACGGGG 3660 

ACAACGACTG CAGGGACTGG TCTGATGAAG CCAACTGTAC CGCCATCTAT CACACCTGTG 3 72 0 

AGGCCTCCAA CTTCCAGTGC CGAAACGGGC ACTGCATCCC CCAGCGGTGG GCGTGTGACG 3760 

GGGATACGGA CTGCCAGGAT GGTTCCGATG AGGATCCAGT CAACTGTGAG AAGAAGTGCA 3840 

ATGGATTCCG CTGCCCAARC GGCACTTGCA TCCCATCCAG CAAACATTQT GATGGTCTGC 39 00 

GTGATTGCTC TGATGGCTCC GATGAACAGC ACTGCGAGCC CCTCTGTACG CACTTCATGG 3960 

ACTTTGTGTG TAAGAACCGC CAGCAGTGCC TGTTCCACTC CATGGTCTGT GACGGAATCA 40 20 

TCCAGTGCCG CGACGGGTCC GATGAGGATG CGGCGTTTGC AGGATGCTCC CAAGATCCTG 4080 

AGTTCCACAA GGTATGTGAT GAGTTCGGTT TCCAGTGTCA GAATGGAGTG TGCATCAGTT 4140 

TGATTTGGAA GTGCGACGGG ATGGATGATT GCGGCGATTA TTCTGATGAA GCCAACTGCG 4200 

AAAACCCCAC AGAAGCCCCA AACTGCTCCC GCTACTTCCA GTTTCGGTGT GAGAATGGCC 42S0 

ACTGCATCCC CAACAGATGG AAATGTGACA GGGAGAACGA CTGTGGGGAC TGGTCTGATG 4320 

AGAAGGRTTG TGGAGATTCA CATATTCTTC CCTTCTCGAC TCCTGGGCCC TCCACGTGTC 4 3 80 

TGCCCAATTA CTACCGCTGC AGCAGTGGGR CCTGCGTGAT GGACACCTGG GTGTGCGACG 4440 

QGTACCGAGA TTGTGCAGAT GGCTCTTGACG AGGRAGCCTG CCCCTTGCTT GCAAACGTCA 4500 

CTGCTGCCTC CACTCCCACC CAACTTGGGC GATGTGACCG ATTTGAGTTC GAATGCCACC 4560 

AACCGAAGAC GTGTATTCCC AACTGGAAGC GCTGTGACGG CCACCAAGAT TGCCAGGATG 4620 

QCCQQQRCGA GGCCAATTGC CCCACACACR GCACCTTGAC TTGCATGAGC AGGGAGTTCC 4680 

AGTGCGAGGA CGGGGAQGCC TQCATTQTGC TCTCGGAGCG CTGCGAOGGC TTCCTGGACT 4740 

GCTCGGACGA GAGCGATGAA AAGGCCTGCA GTGATGAGTT QACTQTQTAC AAAGTACAGA 4800 

ATCTTCAGTG QACAGCTGAC TTCTCTGGGG ATGTGACTTT GACCTGGATG AGGCCCAAAA 4860 

AARTGCCCTC TGCTTCTTGT QTATATAATQ TCTACTACAG GGTGGTTGGR GAGAGCATAT 4920 

GGAAGACTCT GGAGACCCAC AGCAATAAGA CASACACIGT ATTAAAAQTC TTGAAACCAG 4980 

ATACCACGTA TCAGGTTAAA GTACAGQTTC ASIGTCTCAG CAAGGCACAC AACACCAATG 5040 

RCTTTGTGRC CCTGAGGACC CCAQAGGGAT TGCCAGATGC CCCTCGAAAT CTCCAGCTGT 5100 

CRCTCCCCRG GGAAGCAGAA GGTGTGATTG TAGGCCACTG QQCTCCTCCC ATCCACACCC 5160 

ATGGCCTCRT CCGTGAGTAC ATTGTAGAAT ACAGCAGGAG TGGTTCCAAG ATGTGGQCCT 5220 

crcAGAGGGC TGCTAQTAAC TTTACAQAAA TCAAGRACTT ATTGGTCAAC ACTCTATACA 5280 

CCGTCAGRGT GGCTGCGGTG ACTAGTCGTG GAATAGGAAA CTQGAGCGAT TCTAAATCCA 5340 

TTACCACCAT AAAAGGAAAA GTGATCCCAC CACCAGATAT CCACATTGAC AGCTATQGTG 5400 

AAAATTATCT ARGCTTCACC CTGACCATGG AGAGTGATAT CAAGGTGAAT GGCTATGTGG 5460 

TGAACCTTTT CTGGGCATTT GACACCCACA AGCAAGAGAG GAGAACTTTG AACTTCCGAG 5520 

GAAGCATATT GTCACACARA GTTGGCAATC TGACASCTCA TACATCCTAT GAQATTTCTG 5580 

CCTGGGCCAA GACTGACTTG GGGGATAGCC CTCTGGCATT TGAGCATGTT ATGACCAGAG 5640 

GGGTTCGCCC ACCTGCACCT AGCCTCAAGG CCAA.RGCCAT CAACCAGACT GCAGTGGAAT 5700 

GTACCTGGAC CGGCCCCCGG AATGTGGTTT ATGGTATTTT CTATQCCACG TCCTTTCTTG 5760 

ACCTCTATCG CAACCCGAAG AGCTTGACTA CTTCACTCCA CAACAAGACG GTCATTQTCA 5820 

GTAAGGRTGA GCAGTATTTG TTTCTGGTCC GTGTAGTGGT ACCCTACCAG GGGCCATCCT 5880 

CTGACTACGT TGTAGTGAAG ATGATCCCGG ACAGCAGGCT TCCACCCCGT CACCTGCATG 5940 

TGGTTCRTAC GGGCAAAACC TCCGTGGTCA TCAAGTGGGA ATCACCGTAT GACTCTCCTQ 6000 

ACCAGGACTT GTTGTATGCA ATTGCAGTCA AAGATCTCAT AAGAAAGACT GACAGGAGCT 6060 

ACAAAGTAAA ATCCCGTAAC AGCACTGTGG AATACACCCT TAACAAGTTG GAGCCTGGCG 6120 

GGAAATACCA CATCATTGTC CAACTGGGGA ACATGAGCAA AGATTCCAGC ATAAAAATTA 6180 

CCACAGTTTC ATTATCAQCA CCTGATGCCT TAAAAATCAT AACAGAAAAT GATCATGTTC 6240 

TTCTGTTTTG OAAAAGCCTG GCTTTAAAGG AAAAGCATTT TAATGAAAGC AGGGGCTATG 6300 

AGATACACAT GTTTGATAGT GCCATGAATA TCACAGCTTA CCTTOGGAAT ACTACTGACA 6360 

ATITCTTTAA AATTTCCAAC CTGAAGATGG GTCATAATTA CACGTTCACC GTCCAAGCAA 6420 

GATGCCTTTT TGGCAACCAG ATCTGTGGGG AGCCTGCCAT CCTGCTGTAC GATGAGCTGG 6480 

GGTCTGGTGC AGATGCATCT GCAACGCAGG CTGCCAGATC TACGOATGTT GCTGCTGTGG 6540 

TGGTGCCCAT CTTATTCCTG ATACTGCTGA GCCTGGGGGT GGGGTTTGCC ATCCTGTACA 6 600 

CGAAGCACCQ GAQGCTGCAG AGCAGCTTCA CCGCCTTOGC CAACAGCCAC TACAGCTCCA 6660 

GGCTGGGGTC CGCAATCTTC TCCTCTGGGG ATGACCTGQG GQAAGATGAT GAAGAIGCCC S720 

CTATQATAAC TQGATTTTCA GATGACGTCC CCATGGTGAT AGCCTGAAAG AQCTTTCCTC 6780 

ACTAGAAACC AAATGGTGTA AATATTTTAT TTGATAAAGA TAGTTGATGG TTTATTTTAA 6840 

AAGATGCACT TTGAGTTGCA ATATGTTATT TTTATATQQG CCAAAAACAA AftAACAAAAA 6900 
AAAAAA 



iLtRSSRRES LpFLFTLVA LLEPGALCEV WTQRLHGGSA kpQDRGFLV VQGDPRELRL 



MATRSSRRES RLPFLFTLVA LLPPGALCEV W1VKU11>>»»" i-i.i-uuKoiri.v .vv.--—"-" 
WARGDARGAS HADEKPLRRK RSAALQPEPI KVYGQVSLND SHNQMWHVfA GEKSNVIVAL 
ARDSLALARP KSSDVYVSYD YGKSFKKISD KLMFGLGNRS EAVIAQFYHS PADNKEYIFA 



ARDSLALARP KSSDVYVSYD YGKSFKKISD KIiNRjlASHKa cnvinyrino ■rr^L-.»^.w^,.„ 
DAYAQYLWJT FDFQJTLQGF SIPFRAADLL LHSKASNLLL GFDESHPNKQ LWKSDDFGQT 
WIMIQEHVKS FSWGIDPYDK PNTIYIERHE PSGYSTVFRS TDFFQSRENQ EVILEEVRDF 
OLRDKYMFAT KWHLLGSEQ QSSVQLWVSF GRKPMRAAQF VTRHPINEYY lADASEDQVF 
VCVSH3NNRT NLYISEAEGL KFSLSLEHVL YYSPGGAGSD TLVRYFAHEP FADFHRVEGL 
QGVYIATLIN GSMNEENMRS VITFDKGQTW EFLQAPAFTG YGEKINCELS QGCSLHLAQR 
LSQLLMLQLR RMPILSKESA PGLIIATGSV GKNLASKTNV YISSSAQARW REALPGPHYY 
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TWGDHGGIIT AIAQQMBTNB LKYSTNBGBT WKTFIFSEKP VFVYGLLTEP GEKSTVFTIF 
GSNKENVHSW LILQVNATDA LGVPCTBNDY KLWSPSDERG NECLLGHKTV FKRRTPHRTC 
FNGEDFDRPV WSNCSCTRE DYECDFGFKM SEDLSLEVCV PDPEFSGKSY SPPVPCPVGS 
TYRRTRGYEK ISGDTCSGGD VBARLEGBLV PCPLAEENEF ILYRVRKSIY RYDLASGATE 
QLPLTGLRAA VALDFDYEHN CLYWSDLALD VIQRLCLNGS TGQEVIIHSG LETVEALAFE 
PLSQLLYWVD AGFKKIEVAN PDGDFRLTIV NSSVLDRPRA LVLVPQEGVM FWTDWGDLKP 
3 AAYHLVSEDV KWPNGISVDD QKIVMTDAYL ECIERITFSQ QQRSVILDNL 
K NEIYWDDWSQ LSIFRASKYS GSQM3ILANQ LTGLMDMKIF YKGKNTGSMA 
C LPKANNSRSC RCPEDVSSSV LPSGDLHCDC PQGYQLKNNT CVKEENTCLR 
NQYRCSNQNC INSIWWCDFD NDCGDMSDER NCPTTICDLD TQFRCQE3GT CIPLSYKCDL 
EDDOSDNSDE SHCEMHQCRS DEYNCSSGMC IRSSWVCDGD NDCRDWSDEA NCTAIYHTCE 
ASNFQCRNGH CIPQRWACDG DTDCQDGSDE DPVNCEKKCN GFRCPMGTCI PSSKHCDGLR 
DCSDGSDEQH CEPLCTHFMD FVCKNRQQCL FHSMVCDGII QCRDGSDEDA AFAGCSQDPE 
FHKVCDBFGF QCQNGVCISL IWKCDGMDDC GDYSDEANCE NPTEAPNCSR YFQFRCEMGH 
CIPNRWKCDR ENDCGDWSDE KDCGDSHILP FSTPGPSTCL PNYYRCSSGT CVMDTWVCDG 
YRDCADGSDE BRCPLLSNVT AASTPTQLGR CDRFEFECHQ PKTCIPNWKR CDGHQDCQDG 
RDEANCPTHS TLTCHSRBFQ CBDGEACIVL SERCDGFLDC SDESDEKACS DELTVYKVQM 
LQKTADFSGD VTLTWMRPKK MPSASCVYNV YYRWGESIW KTLETHSNKT NTVLKVLKPD 
TTYQVKVQVQ CLSKAHNTHD FVTLRTPEGL PDAPHNLQI.S LPREAEGVIV GHWAPPIHTH 
GLIREYIVBY SRSGSKMWAS QRAASNFTEI KNLLVNTLYT VRVAAVTSRG IGNWSDSKSI 
TTIKGKWIPP PDIHIDSYGB NYLSFTLTMB SDIKVNGYW NLFWAFDTHK QERRTLNFRG 
SILSHKVGNL TAHTSYEISA WAKTDLGDSP LAFEHVMTRG VRPPAPSLKA KAINQTAVEC 
TWTGPRNWY GIFYATSFLD LYRNPKSLTT SLHNKTVIVS KDEQYLFLVR VWPYQGPSS 
DYWVKMIPD SRLPPRHLHV VHTGKTSWI KWBSPYDSPD QDLLYAIAVK DLIRKTDRSY 
KVKSRNSTVE YTLNKLEPGG KYHIIVQLGN MSKDSSIKIT TVSLSAPDAL KIITBNDHVL 
LFWKSLALKB KHFNESRGYB IHMFDSAMHI TAYLGMTIDN FFKISNLKMG HMYTFTVQAR 
CLFGNQICGB PAILLYDBLG SGADASATQA AHSTDVAAW VPILFLILLS LGVGFAILYT 
KHRRLQSSFT AFANSHYSSR LGSAIFSSGD DLGBDDBDAP MITGFSDDVP VNlh 

Seq ID NO: 79 DMA sequence 

Nucleic Acid Accession «: Eos sequence 

Coding sequences 120-755 

1 11 21 31 41 51 

111111 
CCTGCCACTC CCCTGCCAGT GTGCGAGGCA GOSTGAAGCT GGGGCCTGCT CCCCGCAGCC 
TCTGGAGCGC A-l-CTCAGACC TTCTGAGACC TATGTTGCTG GCC(XCCAGA ACCCGCAACA 
TGGCAGATGG GGCAAAGGCC AACCCCAAAG GGTTCAAAAA GAAGGTGCTG QATAGATGCT 
TCTCTGGGTG GAGGGGCCCA CGCTTCGGGG CTTCCTGTCC TTCRAGAACC TCCRGGTCTA 
GCCTGGGTAT GAPGARGTTC TTCACCQTGG CCATCCTTGC TGGCAGCGTT CTGTCCACAG 
CTCACGGCAG CCTGCTCAAC CTQAAGQCCA TGGTGGAGGC CGTCACAQGG AGGAGCGCCA 
TCCTGTCCTT CQTGGGCTAC GGTTGCTACT QTGGGCTGGG GGGCCGTGGC CAGCCCftAQG 
ATQAGGTQGA CTGGTGCTGC CACGCCCACG ACTGCTGCTA CCAGGAACIC TTTGACCAAG 
GCTGTCACCC CTATGTGQAC CACTATGATC ACACCATCGA GAACAACACT GAGATAGTCT 
GCAGTGACCT CAACAAGACA GAGTGTQACA AGCAGACATG CATGTGTGAC AAGAACATGG 
TTCTGTGCCT CATGAACCA3 ACGTACCGAG AGGAGTACCG TGGCTTCGTC AATGTCTACT 
GCCaGGGCCC CACGCCCAAC TGCAGCATCT ATGAACCGCC CCCTGAGGAG GTCACCTGCA 
OTCACCAATC CCCAGCGCCC CCCGCCCCTC CCTAGAGCCT CTGAGGTTTG AGAGAGAGAG 
Q3GGAQGAGG GTCTQQCTTG GGGACCAGAC GAGGTGCAGG GAGGGTAGGA GCCAGGCCAG 



GAGCCTGAGG GTTGCTGGTT GCCTCCTCCC TGGAGCTCTC CAGTGAGGGC 

,G GAAGGCCTGG GTCCTGACTC CCCCAGCCCA GCCCCAGGCA TGGGTGCCTC 



CTGCTGCTGG TTCTGGACTG GGTGGGAGGC ACGGAGCTTA TAGGGGTCTC 

GGCCGGGOAQ ACCT3AGAGA GAGGAQQAGG GGCCTCTGAG TGGGGCCTCT □TTGCTGGv.G 1080 

CCAGTrTAAC TCCCCGGAGC CTTAGAftAGT CTGAGCCTTA GAAAGTCTGG GCCTGAGCAT 1140 

55 CCAGGCCCAG AQCTQGATGC ATCCTCGGCC CAAGATCACA GGAAGGCAGA "GCIGGTCA 1200 

CAAAGCATGG GTTCCGGGAG CCCCTCAGCT GATCCCACAG GATGGCCTGG GGTGGTGGCT 1260 

ACTTTX3GGCT TGAflGCTCTC TAGAGCCCCA TTCACAGAGC AGGTGCCCCA CCCCAGCCCT 1320 

^?G^TG GCCATGAGTC CAGGTTCAGA CCCTAGGGGA IGTGCAGTGC TGAGTGTCCC 1380 

AGCCCTGAAG TCTAGCCTCA GAGCAGGCTC CTGGGCACTG ACAGAGCCAC ACACCCAGGT 1440 

60 ACACCCCCCa GGCTAATATG GGGACaCACA AAAATATGGC CTTCAACACC TACAAGTCTT 1500 

CTCTCTGTCC CCCATCGRCC CATGCACATC ACCCCAACCC TGCTCTGCCA CACCACGTGG 1660 

GTCTGAGGAG CACICCTACC CCCCGGCTCC GGGTCCCTGA CAGACACTGC CCTCCTGGCC 1620 

TGCACCTGGA ATGGCAGCCA GAACaCTQQA GGGGGACCCA CACCCTCTCT CCCTGGGCTG 1680 

TGGATCGGK3 TTGCAAGTCC CATCCCTCCT GGCCTGCACC TQQRACAGCA GCCAGAACAC 1740 

65 a3GAGGGGGA CCCACACCCT CTCTCCCTGG GCTGTGGGCA GGCGTTCACAA GTCCCATTGG 1800 

ioGG^^AMG GCTGAGGGCT QAGACCCACC TCAGTGAAGA GGGAAACGTA AGCAGGGAGA 1860 

GCATCAAGGC GGGAGGCAGC CTGGAAACTT CTAGAAGAGQ QTCAAGGTAG GAGGCTACAT 1920 

QQAAGCTTCT AGAAGGCAGT AAGGAACAGG GTGGTGAAGG AGAGGGGAGA AQATGGGAGC 1980 

A?^AGAGAC G^TGAGTG ACAGCCACCA GCAGGGACGG GCCICCAGGC AACACAGAGG 2040 

70 aSgGGGGC a^GCCTTA CG«3GCTGGG TTCCTGTGTG GGGCGTGGAG CTGGGGCTGA 2100 

ArTcnAaaaa ACGGCCCTGC GGCCCCC»GC ACTGTTGTTT CTCCAGGTGC AOCTTACCTT 21S0 

cScS™?c tSc^GCC TCAACrCAGG GTCAGCAGCC TCCACGGAGC CAGCCCCACC 2220 

T^cSaAAA CCACTGAAGC CACCCTCCQC CCQACGTCAA CGGGGATCCA CTAGTTTAGA 2280 
GCMCQQCCCC CGCGTGC 
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MflDGAKANPK C3FKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFFTV AILAGSVLST SO 

AHGSLLHIiKA MVEAVTGRSR ILSFVGYGCY CGLGGRGQPK DEVDWCCHflH DCCYQELFDQ 120 

5 GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLMVY 180 

CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 



I I I I I 

GAAATCTGTA CGAACTATTC AGQCTGCTTT TAGAGGCATG AAAGTTAGAC ™™„».„.„. 

1 5 AAATGTATCA GAGGAAAAGA TGGCAGCCAT .TGTTAACCAA TCTGCACTCT GCTGTTACAG 
AAGCAAAACT CAGTATGAAG CTGTTCAAAG TGAAGGTGTT ATGATTCAAG AGTQGTATAA 
AGCTTCTGGC CTTGCTTGTT CACAGGAAGC AGAGTATCAT TCTCAAAGTA GGGCTGCAGT 
AACAATTCAA AAAGCTTTTT GTAGAATGGT CACAAGAAAA CTGGAAACAC AGAAATGTGC 
TGCCCTACGG ATTCAGTTCT TCCTTCAGAT GGCTGTGTAT CGGAGAAGAT TTGTTCAGCA 

20 QAAAA3AGCT GCTATCACTT TACAGC^TTA TTTTAGGACG TGGCAAACCA GAAAACAGTT 
TrrACTATAT AGRAAAGCAG CAGTGOTTTT ACAAAATCAC TACAGAGCAT TTCTGTCTGC 
AAAACATCAA AGACAAGTCT ATTTACAGAT CAGAAGCAGT GTTATCATTA TTCAAGCTAG 
AAGTAAAGGA TTTATACAGA AACGGAAGTT TCAGGAAATT AAAAATAGCA CCATAAAAAT 
TCAGGCTATG TGGAGQAGAT ATAQAQCCAA GAAATATTTA TGTAAAGTGA AAGCTGCCTG 

25 CAAGATTCAA GCCTGGTATA GATGTTGGAG AGCaCACAAA QAATATCTAG CTGTATTAAA 
AQCTQTTAAA ATTATTCAAG GTTGCITCTA TACCAAACTA GAGAGAACAC GGTTTTTGAA 
TGTGAGAGCA TCAQCAATTA TC^TTCAGAG AAAATGGAGA GCTATACTTC CTGCAAAGAT 
AGCTCATGAA CACTTCTTAA TGATAAAAAS ACATCQAGCT QCTTGTTTGA TCCAAGCACA 
TTATAGAGGA TATAAAGGAA GGCAGGTCTC TCTTCGGCAG AAATCTGCTG CTTTGATCAT 

30 ACAAAAATAT ATACQAGCCA GGGAGGCTGG AAAGCATGAA AGGATAAAAT ATATTGAATT 
TAAAAAATCT ACAGTTATCC TACAAGCS^CT GQTQCQTGQT TGGCTAGTAC GAAAAAGATT 
TTTAGAACAG AGAGCCAAAA TTCGACTTCI TCACTTCACT GCAGCTGCAT ATTATCACCT 
GAATGCTGTT AGAATTCAAA GAGCCTATAA ACTTTACCTG GCTGTGAAGA ATGCTAACAA 
GCAGGTTAAT TCAGTCATCT GTATTCAGAG ATQGTTTOSA GCAAGATTAC AAGAAAAGAG 

35 ATTTATTCAG AAATATCATA GCATCAAAAA GATTGAGCAT GAAGQTCAAG AATGTCTGAG 
CCAQCGAAAT AGGGCTGCAT CSiGTAATACA GAAAGCAGTG CGCCATTTTC TCCTCCGTAA 
AAAGCAGGAA AAATTCACTA GTGQAATCAT TAAAATTCAG GCATTATGGA GAGGCTATTC 
TTGGAGGAAG AAAAATOATT GTACAAAAAT TAAAGCTATA CGACTAAGTC TTCAAGTTGT 
TAATAQGGAG ATTCGAGAAG AAAACAAACT CTACAAAAGA ACTGCACTTG CACTTCATTA 

40 CCTTTTQACA TATAAGCACC TTTCTGCCAT TCTTGAGGCC TTAAAACACC TAGAGGTAGT 
TACTAGATTG TCTCCACTTT GTTGTGAGAA CATGQCCCAG AGTGGAGCAA TTTCTAAAAT 
ATTTGTTTTG ATCCGAAGTT GTAATCGCAG TATTCCTTGT ATGGAAGTCS^ TCAGATATGC 
TGTGCAAGTC TTGCTTAATG TATCTAAGTA TGAGAAAACT ACTTCAGCAG TTTATGATGT 
AGAAAATTGT ATAGATATAC TATTGGAGCT TTTQCAQATA TACCGAGAAA AGCCIGGTAA 
45 TAAAGTTGCA GACAAAGGCG GAAGCATTTT TACRARAACT TGTTQTTTQT TGGCTATTTT 

^Sc^?CTC ta^aI™ cao~ :ca=tc IS™ gSStI 

CAAQCAAAAG AAGAATTCTT CIATAAGCAT TCCTTTTATC CCAGAAACAC CTGTAAGGAC 
CAGAATAGTT TCAAGACTTA AGCCA3ATTQ GGTITTGAGA AGAGATAACA TQQAAGAAftT 
50 CACAAATCCC CTGCAAGCTA TTCAAATQGT GATGGATACG CTTGGCATTC CTTATTAGTA 
AATGTAAACA TTTTCAGTAT GTATAGTGTA AAGAAATATT AAAGCCAATC ATGAGTACGT 
aa&GTGATTT TTQCTCTCTG TGTACAACTT TTAAAATCTG ACTTTGTTTT AAAAAAACAT 
AAAETGTTCA TTACATTCTT CATTTTTATC ATTTATAGTT TTATGCATSI AATAAACTAA 
TATGTCATAA GATG 



1 1 1 I 1 

MKVRQKLKNV SEEKMAAIVN QSALCCYRSK TQYEAVQSEG VMIQEWYP 
HSQSRAAVTI QKAFCEMVTH KLETQKCAAL RIQFFLQM.W VRRRFVQQKE --Q"^^^ 
TWQTRKQFLL YRKAAWLQN HYRAFLSAKH QRQVYLQIRS SVII_Q.^SK 3FIQ.<RKFQE 
IKNSTIKIQA MWRRYRAKKY LCKVKAACKI QAWYRCHR.^H KEYLAVLKAV KIIQGCF TK 
LERTRFLNVR ASAIIIQRKW RAILPAKIAH EHFLMIKRHR AACLIQAKYR ^f^KGRQVS^R 
QKSAALIIQK YIRAREAQKH ERIKYIEFKK STVILQALVR GWLVR.<RFLE QRAKIRLLrfF 
TAAAYYHLHA VRIQRAYKLY LAVKNANKQV NSVICIQRWF RARLQSKRFI QK/.-ISIKKIE 
HEQQECLSQR MRAASVIQKA VRHFLLRKKQ EKFTSGIIKI QALWRGYSWR KKNDCTKIKA 
IRLSLQWNR EIRBENKLVK RTALALHYLL TYKHLSAILE ALKHLEWTR 13^1;==™ 
OSGAISKIFV LIRSCNRSIP CMEVIRYAVQ VLLNVSKYEK TTSAVYDVEH ClDILLEIiQ 
?Se^S ADKGGSIFTK TCCLIAILLK TTNRASDVRS RSKWDRIYS ^YKLTAHiO^ 
MNTERILYKQ KKNSSISIPF IPETPVRTRI VSRLKPDMVL RRDNMEEITN PLQAIQMVMD 



Coding sequence: 
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1 11 21 31 41 51 

I I I I I I 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC OSTCCCCCCC AGCTCCTAGC CTCCGACTCC 
CTCCCCCCCT CACGCCOSCC CTCTCGCCTT C33CCQAACXA AAQTQQATTA ATTACACGCT 
TTCTGTTTCT CTCCGTGCTG TTCTCTCCCG CTGTGOSCCT GCCCX3CCTCT OSCTGTCCTC 
TCTCCCCCTC GCCCTCTCTT CGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 
CTCTGCCCCC CTCTATCCTT GATACAACAG CIGACCTCAT TTCCCGATAC CTTTTCCCCC 
CCGAAAAGTA CAACATCTGG CCOSCCCCAG CCOSABGACA GCCCGTCCTC CCTGGACAAT 
CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 
ACATTCGGCC CCCGCGACTC GGCCAGAGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 
CCAACGCCCG CTGTTCGGTT TGCEACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 
TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CaCCTTCTTG 
GCCTTCGCCT CGTGCTQCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 
CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAQ CAGGCCCGCA 
AGCCGTGTGA GCCGTaSCAG CCGTGGCATC GTTGAGGAGT GCTCTTTCCG CAGCTCTGAC 
CTGGCCCTCC TGGAGACGTA CTQTGCTACC CCCGCCAAGT CCGAGAGGGA OGTGTCGACC 
CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 
GACACCTGGA AGCAGTCCAC CCAGCGCCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 
CGCCGGGGTC ACGTGCTCGC CAAGGAQCTC GAGGCGTTCA GGGAGGCCAA ACGTCACCGT 
CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 
AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 
CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCOG AAAATCTCTC GGTTCCACGT 
CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 
CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCRT CTTCAAACAT GTACAAAATC 
GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 



MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YFSRPASRVS 60 

RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 

QSTQRLRRGL PALLEARRGH VLAKBLEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 180 

Seq ID NO: 85 DMA sequence 

Nuclaio Acid Accession «: 3!M_092643.1 

Coding sequences 1-S352 

40 

1 11 21 31 41 51 

I I I I I I 

ATGACTGAAG CCATTACAGA AGCAGCAGTA GCCTCAAGTG AGGAGGTGTC AGAGGAAAGA 60 

GACGACCTGG GGCCTCTTGA ATTACATGAC AGTGGrTACCT TTCAACAAGT CGTGAACCTT 120 

45 CTGSACATCA TTGACAGCGA GTCAGCAAAG ACGGACACAA CAGGGGCAGG CCTTGACATG 180 

CGGAAGACCC TGGCCTCGGT GATAATCATG GAGAAGGCCS CCACTGAGCC TTCTGTAGTG 240 

ATAAACACTC TCATCCGCTG CCTGCAGGTG CC3M3AGATTT CCACCCAGCG CAAGGTCAAC 300 

ATTTACAACA TCCTCCAGGA CATCATCCAG CAGGAGGGGG AGCTGGAGGA GCAGTGCGTG 360 

CAGAGGCTGG TGGCCATTGC CTCCAAGGAG ATGAGGGAGR TCCCAGAGAT GGAGGGCTAT 420 

50 ATGAAGGCAG AGGTGGCCAG CGACACACTG GTGGCTCTGI CCOGAAACCA CTTCAGCTTG 480 

GTCATGTACG AGCTGCAGCA CCACCTCAAG CCCCTOiACC TCACTGATGA ATTTGTCATC S40 

ATCACACTGG CCAAGCTGOC CAACGGCAAT GTGTTTGAGT TCATGCCATA CATGGGCATC 600 

ACCCTGGCTA CCATATTCAC CATGCTGAGA CTTGCCAATG AAGCCAAGAT ACGCCAGGCG 660 

ATCTGCAGTG CCATGGAGAC CTTCTGTGAG ACGGTGCAGT TTTATCTGAA GaCCTGGAG 720 

55 GAGAGCGTGT ACCCCGTGAT GACTGAGGAG GAGTTTGCXC TGAAGGTGTT CCCCATGTAT 780 

CGCTACTTCG TGACAGTGTG GCTGAGGCAC TACAACCCOG AGGTGAAGCT GGGGGTGATC 840 

AAGTCCCTGA AGCCCATGCT CGGCCTCCTT CTGCCCAACG ATGACCTGOS GGAG««3TC 900 

TACGACTACA TCCCCCTGCT GCTGGCGGAG TACCAGGGCA GTCTGGAGGT CTTGAGGCAG 960 

ATCCTGGAAC TGTCAGTCAC CACCAACACC CCTGTCCCCC AAATGCAGCT ACACACCATT 1020 

60 TTCACAGAAC TGCACGTCCA GGTGTGCAAC AAGGCCCCGG CCCAGCATCA S™CAGCAGC 1080 

CAGAATCTGA TGGAGATGGT GCACTGCTTC GTAGCCCTTG CTCGCTCCTA CC>.CAAG^A^ 1±40 

CTGATGAAGT TCTTCTTCAG CCAGATGGAG ACAAACAAGG AGGCCGTCCQ CGTGGGGACT 1200 

CTGAATCTGA TTAGGGCTAT AGTGAGCGCA GATGAGCCCA GGATGAGTAT CAGGGCCATC 1260 

TACCTGGCTA TCCGGGTAGT CAAGAACACC ATCTCTGATA CCCGGTCC.AA GGIGAGGAT^ 13^0 

65 GCTATTCTCC ACATCATTGG GCAGTTGGCT CTCTGTGGCT ACCAGGAGAG AATCAAAGGC 1380 

TGGGGCCTGA AGTACCTGTC TGTGCAGCTG ACCTTATCCA CCTACAAACT GACAAATCGC 1440 

OSGGAGAAGT TTTATCAGAG GGACTTGGAG GAGAGGATGG TCCACAAAGT CACCATGGAC 1500 

ACTGTGAAGA TCATTACCTC TTCTGTCAGT GGGATGACCA CCGAGTTTTG GGTGAGGCTG 1560 

CTGTGCTACA TCATGGAGAC AGACTACGTG GAAGCITTGA CTCCTATCIG TATCAGCCTC 1S20 

70 irAAACCTGG CAGAACACCA GCTCCATGGC CAGGATGTGG ATGTCAGCGT GGCTGGCAAG ISBO 

AGCAGGCAAG TGGACCTGCC TGCACCTCAG AAGCTGCTGG CCCGTCTCCT GGTGCTGATG 174 0 

TCATCACCTT ACAAGGGGGA GGGTCGTGGG ATAGCCATGC TCAACCTCTT GAGGACCCTG IBOO 

AGCCAGAGCA TCGCACCCTC CATGGCCGAC ATGTGGGAGC TGGAGATTGC GCTACTGGTC 186 0 

OGGTACCTGG AAGAACATAC TGAGTTCACT TGGGATCAGA AAGCCTGGGA AGACAAGCTG 1920 

75 ATTCAGTTTC TGOGAAACTC CCTCAAGAAG ACCOGGGGGT CTAGCTGGAG CCTGCGCTTG 1980 

AGTAAAGAGC TGAACAACCA GATTGCGAGC TTTGACAGCC CCTCTCTGGA GAAGGGCTTT 2040 

SgTACCGGG CCTTGGGCTT CACCTTGGCC ACAQGCCTGG AGGCCAGCAA GGTGSAGGTC 2100 

CTGCTGTTGG AGCTGCTGTA CSiAGACGGAC TACAGCAATG ACTTTGACAG CGAGGGTGTG 2160 
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IT GCCACCCCCA GTTGCTCCTC AACCTCGTGG ACAGCCCCAT CACCGCTARG 2400 
:C ATTRTGTCAG CAGCTGCCAG GACATCTGTC TCAAAATGGC CTTCATGAAG 2460 
AUTiiTHilGC AGGTTACCAA GGCCATCAAC AACATCAAGG ACCTGGAGGA CTTTCACTTT 2520 
GCCCAGAAGA CGACTCTTAC CAGCATTATA GTGGCGGTCA TCAAGGCAGA ACCGACTGAC 2580 
•T CTCCAGTGCG AGCCTTGGCG ATG3AGGCCC TCTCGCACCT GAGCRAGCTG 2640 
"lA GGAAAACAGT GAGCTGATGG ATATCAGCAT ACATTCTGTA 270 0 

^G AGAGGACAAT GAGTCCATTA AGACCCTGTA TGCRAATGCC 27 6 0 

CTGAGCTCCC TGGAGCAGCT GATGGAGAGC CTCCTGCAGA GGCAGCTGGA CCCCAAGGGG 2820 
CTGCAGGAGA TGGTGCAGCT CCTGGAAAAG TGGATCTTGT CGGAGAAAGA ATGGGAGCGG 2 8 80 
GAAAAGGCCG TGAGCCTCCA TCTCTATCTC ATGTGGATTT ATGTCCACAG CACTGCTGTC 294 0 
TGTATCCACC TAAAGCTGGG GCAGTTTGGC ACAATGGTCG GACTCATTGC CCCGTGCACC 3000 
15 TGTGATGCCC ATCflAAGAAC CCGCATGGCC TCAATGAATG TCCTGTCCAG CCTGCTAGAT 3060 

CTTCACGCAA GCCAGACCTG CTCCTTGTGG GGCCCTTCCA AGCAGAAGGA GCTTGAGA?JS. 3120 
TGTARGGGGG ACCTCCAGAG CACAGATGTG GAGAAGATCT TCTGTGCATC CTGCAGAATC 3180 
GCCAAGGTGG TCTGCATGGA GTTTAGCTGC GATGAGGTGG TCTCGCTCAT CCAGAAGCTC 3240 
TGCGAGAACA CIGGGGCCAT GAACCTGCAG CATGACRAGG CCTCrTGTCAC CTGGATAGCC 33 00 
20 TTCTTCCTCC AGATGCGGGC CAAGGAGCTG GAGGACAAGG TGGCCGAGAT CCTGAGTGCC 33 60 
ATCCTGGTGC ACCTGCOSGT GGTGGACCAC CCAGAGGTGC GGCGCCTTCT CATTGACGGC 3 4 20 
ATCCTGCTGC TGGOSCACCA CCACCAGGAG ACCRTCCTCA CATCGCTCCT GAGGCAGCCA 3 4 80 
CTGCCCATGG AGAGCCRCCT GGCAGAGGTG TGGCTGGCAG TGTCGGAGAA CGTGCCCTTC 3540 
GCCCGGACCA TGCTCCaCAG CCTGATGGGC CGGCTGCAGT CAOSGCTCA.G CGCCAGAATC 3 600 
25 AGTGCCACCT OaiAGGCTGA CATCTGGCGC CTGGCTGCGG TGGACCCCCT GATGACCCTG 3 6 60 
TGCACCATCC ftCCTTCTCAT TCAGAAGCTG GATGAGAATG ACAAGCTCCC GGACTTCCTC 3 7 20 
CCTGACCTCA TCTACACCCT CCTGCTGCAG CTTGGRAGCA GCCACCGRCC AGAGGCCGCC 37B0 
CCGCCGGTCT TGAAGATGTG GAAGCTGGTC CACACCACTC CTCTGCCGGA GGAGATGAAC 3 840 
CTGCAAAGGT GCTCTCGAGG GCGGGTCACT ATCAAGTCCA TGCAGCTCTT GTTCAAGAGA 3 900 
30 GTCAAGAGCC AGCACCTGGC ACATACCCTG GACGAGCAGG CAGTGTGGGA CCTCCTGCAG 3 960 
GACGGCGGGA CATTCCTGGA GGGTGTGAGC CTGCTGGCCa GGCTGTGCAT GCAGCACGTG 4020 
GAGGGCCACa GGCAGAGGCT GGCCGAGCTG GTGCTCAGGG GCATGGACTC AGAAGTCCTG 4080 
AGCTGCCGCA TCAGCSGCAC AGCKGTCTGC GTGGRAATGA GGCACCGGTT CATGAGCGGC 4140 
CCAGTTCTGT ACCAGGAGAR GCTGCTGAftG CCGGCAGCTT TGCTGCTGGA GAAGGGTGCC 4200 
35 GACCAGGAGG AAGACGAGGC CCTGCGGGTG CTGTCCCTGC GCGCCCTCGG CAACATGGCC 4260 

CTGGGCGCCC CCAAGAAGGT GAAGCAGTAC CGGAAGGTCT TGCTGGAGAA GTGCCTGGGC 4320 
CCCCTGAGGG AGCCCGTGAG CAACAGCGTG ACTGCCGAGG GCATGGAGGC CCTGACCAAG 4380 
ATCCTGGCTG AGCTCCGGGA AGGGGATGTG GGGTCCTCTT TCGACX3CCAT GTCTGAGCAG 4440 
TGCAGGATCT TCTTCGACAA CGAGAGCGAG CTGCTGCGTC TGAAAGCCTT CATOCTCTTT 4S00 
40 GGAAAGCTGG CRAGGGTGGT CGGGATGTCC AAGAAGCATT TCTTCSAAGG GGAGGTGARG 4S60 
AAGGCCTGGA TCCCCCTCAT GCTGCACTCC CAGGACCCCT GCTCCAATGC AGCCCAAGTA 462 0 
^A?AcLcC TGGGCTTTCT CTCCCAGTCT GGGGCCCGCT GTTCCCCGAG GAAACAGGTC 4680 
CTTGGGTCTG CTGGCTCGGC CTGTATGGCT ACCATGTTTC AGTGTGTGCA CTTCTGGGGC 4740 
TGGAAGTCCC TGGAGCATCC CTCAGGGCCA AGTGATACCG CTACTGATGA O^TGACC 4800 
4< CTTTTCCAGA CAACCATGTG CTCCATCCTG ACTCGGAAAA RGCCGGCTGT TCTCTACOSC 4860 

TTCTTGCTAG AAACAATGGC CTATGTTAAA AATAACTTGT CAAGAATCAG AATCGCTGCT 4920 
TGCAACTTGG CAGGAATTAT TATGAAGCAG ATGTCTACAC ATTATCTGAA AAAGCTGGAC 4980 
TTCCCAGCAT TACGGAATTC CCTCCAGGAA CTACAGCTGG ACCCGGATCC CGGGCCCACT 5040 
GGCCCAAAGG GTCGGAAACT RGCTTTGTGT CTTCACTCGG AGTTCATCCA CACGCATGCT SlOO 
rl«™c SSSg GCACTGTGCC AAGCGAGCAG GTAAACCAGG ACTTCCAAAC 5160 



50 GACAGCGTGC CTAAGGCCAG GCACTGTGCC AAGCGAGCAG UTftAAi:iJ«jvi - 

TGTTCCCRGT CCAACTGCCA AAGGAATTCA CCATCCGGGG TCCCATCAAC ASGAGAGAGC 5220 

TGCTATGCTG TCCAAAATCC AACCCGTGCT AAAGCCTTCT CTGGAACAGT <^^^ "80 

CTCTTTTGTG GRGCAAATGG TGACTTGATT TCCTGCCCAG TGATAAAGAG TATTGCTGCT 5340 
AAACAAGCTT GA 



Seq ID NO: 



iItEAITEAAV isSEEVSEER DDLGPLELHD SGTFQQWNL LDIIDSESAK TO^^G^™ 
RKTLASVIIM EKATTEPSW INTLIRCLQV PEISTQRKVM lYNILQDIIQ QEGELEEQCV 
QHLVAIASKE MREIPEMEGY MKAEVASDTL VALSRNHFSL VMYELQHHLK PLNLTDEFVI 
?T^^GN VFEFMPYMGI TLATIFTMLR LANEAKIRQA ICSAMETFCE TVQFYLKHLE 
ESVYPVMTEE EFALKVFPMY RYFVTVWLRH YNPEVKLGVI KSLKPMLGLL I-PNODLREQV 
YDYIPLLIAE YQGSLEVUtQ ILELSVTTOT PVPQMQLHTI FTELHVQVCN KAPAOHQYSS 
QNLMEMVHCF VALARSYPKE LMKFFFSQME TNKEAVRVGT LNLIRAIVSA DEPRMSIRAI 
?LAIRWKNT ISDTRSKVRM AILHIIGQLA LCGYQERIKG WGLKYLSVQL TLSTYKLTNR 
REKFYQRDLE ERMVHKVTMD TVKIITSSVS GMTTEFWVRL LCYIMETDYV f^LTPICISL 
TNLAEHQLHG QDVDVSVAGK SRQVDLPAPQ KLLARLLVLM SSPYKGEGRG lAMLNLLRTL 
SQSIAPSMAD MWELEIALLV RYLEEHTEFT WDQKAWEDKL IQFLENSLKK TRGSSWSLRL 
SKELNNQIAS FDSPSLEKGF LYBALGFTLA TGLEASKVEV LLLELLYKTD YSNDFDSEGV 



SKELNNQIAS FDSPSLEKGF LYBALGFTLA TULEABRVEV i.i.i.ciiui:^iu 
IMCFGLCARG QVKTVLWLH DFEERIQESE QSWQISAWRK DHPWRRETVK SALMVMYSCV 
ASYCHPQLLL NLVDSPITAK IIHHYVSSCQ DICLKMAFMK SWQVTKAIN HIKDLEDFHF 
AOK??L?SII VAVIKAEPTD NLVSPVRALA MEALSHLSKL KPFYSTEENS EIWDISIHSV 



AQKTTLTSII VAVIKAEPTD NLVSPVRALA MlSAbSHl.aiu. rtrriBiiia.N^ ""'"^"Zu aVn 

ISLQLPGEDN ESIKTLYAHA LSSLEQLMES LLQRQLDPKG LQEMVQLLEK "I^SEKBWER 960 

EKAVSLHLYL MWIYVHSTAV CIHLKLGQFG TMVGLIAPCT CDAHQRTRMA SMNVLSSLLD 1020 

LHASQTCSLW GPSKQKELEK CKGDLQSIDV EKIFCASSRI AKWCMEFSC DEWSLIQKL 1080 
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CENTGAMNLQ HDKASVTWIA FFLQMRAKEL EDKVAEILSA ILVHLPWDH E 
ILLLAHHHQE TILTSLLRQP LPMESHIAEV WLAVSENVPF ARTMLHSLMG RLQSRLSPRI 
SATSKADIWR LAAVDPLMTL CTIHLLIQKL DENDKLPDFL PDLIYTLLLQ LGSSHRPEAA 
PPVLKMWKLV HTTPLPEEMN LQRCSRGRVT IKSMQLLFKR VKSQHLAHTL DEQAVWDLLQ 
DGGTFLEGVS LLARLCMQHV EGHRQRLAEL VLRGMDSEVL SCRISSTAVC VEMRHRFMSG 
PVLYQEKLLK PAALLLEKGA DQEEDEALRV LSLKALGHMA LGAPKKVKQY RKVLLEKOiG 
PbREPVSNSV TAEGMEALTK ILAELREGDV GSSFDAMSBQ CRIFFDNESE LLRLKAFILF 
GKLARVVGMS KKHFFKGEVK KAWIPLMLHS QDPCSNAAQV RYILGFVSQS GARCSPRKQV 
LGSAGSACMA TMFQCVHFWG WKSLEHPSGP SDTATDDKMT VFQTTMCSIL TRKKPAVLYR 
FLLETMAYVK NNLSRIRIAA CNLAGIIMKQ MSTHYLKKLD FPALRNSLQB LQLDPDPGPT 
GPKGRKLALC LHSEFIHTHA DSVPKAEHCA KRAGKPGLPN CBQSNCQENS PSGVPSTGES 
CYAVQNPTRA KAFSGTWFK LFCGANGDLI SCPVIKSIAA KOA 



I I I I I I 

CACTTCTCCC TCTCCTTAGT TCTCCGGTAA CCAAACCTTC TCTCGTTATA TACCCAATGA 60 

TAAAACCAAA GTAATGTTGT ACAGGTTAAA GATTCAAAAC AAACCATAGC AACTGAAACA 120 

GGGTATCAGG ACGCAAATGC TTGGCTGGAA TGGATTAAAC ATTCTGTTGC AGGCTAAACA 180 

AAAGCGCCTG TTACACTTGT GTGACAGGCA GGCCAGAGAC CCAAATTGTT CCCTTTCCAC 240 

TTGGGTGGTC CTCTCACAGA CCGGGCATGA GCTGTGTGGT AGCTCTCTTC CAGAACCCCA 300 

CAGCCTGGGG TGGTGAGTCA TGCAAGACTC TTTCACTGCT GTTCCCTGAG GTCAAGAGCC 3 60 

CTGCAGGTCA GCCCTGAGGA CTATCCGGCC TCCAGCCCCT GATGTTAACT TCACCTCATG 420 

CCTTTCACGG CAGGGGGAAA AGTTAGCATT CCTTGGAGAC T7AACACGGT GCAGTGAAAC 4B0 

CAGGCCTTTT CAAGAGCTTA CCAATCAGTC TGCCCTTGTT CATCCCCQAG CAGATGTGTG 540 

GTGGTATTGC TGGGGACTAC TGCTGGGTAC TCTGCCAAGT AATTAGAGCA GCACTCATGC 600 

TCTAGTCCAA CTGGCCATCC CTTTCACCCT AGCATTCTGT CAATATGATA ?AAAAAAATG 660 

TAAAGAAGTG CCTCACGTGG GTCCTTGACC CCCATGTTTA TATAAACACT ATTGGGTTCT '720 
ATGAGGAGAA CCCGATGA 

Seq ID NO: as DNA sequence 

Nucleic Acid Accession #s FGENESH predicted 

Coding sequence: 1..7S9 

1 11 21 31 41 51 

ATGCAGGAAG AAACAGGCTG GCAGCTTGGG GAAAGGATTC TTACATACCA CGGCGACCAG 60 

GTAACTCTGT GCACAGACCC AGAAGAAAGT AGAAAAATAA GTTTTGGAGG AAACCTCATT 120 

GTCAACACAC CTCACCAGGT CAGARCTATC CTAAGTCAGA AAGCAGAAAG GTGGCTTACT 180 

GACTCCAGAA TCTTAAAGTA TGAGGCTATT CTCTTAGAAA AAGATGATTT AACATTAACC 240 

ACTGATAATT CACTTAACCC AGCAGGTTTC CTARCAGGGG ATCTATATCA AAAGAGAGAG 300 

CACACATGTT TAGTTTTAAT TAATTACCAT ACAAAGGTCC GACCAGACCT AGGAGAAACT 360 

CCCTTCAGGA CGGGACAACA CTTTATAGAT GGTTCCTCCC AGGTGATAGA GGGAAAAAGA 420 

TACAATGGGT ATTCAGTAAT TGATAGAGAA ACTCTTGTAG AAATAGAGTT AGGAAAATTG 480 

CCTAATAATT GGTCTGCTCA AACATGTGAG CTGTTTGCAC TCAGCCAAGC CTTAAAGGAC 540 

TTACAGAACC AGGAAGGAAC CATCTATACA GATTCTAAGT ACGCCTTTGG AGTGGCTCAT 600 

ACATTTGGAA AAATTTGGAC TGAACGAGGT CTTATTAATA GCAAATGCCA AGACTTGGTC 660 

CATAAGGAAT TAATCACCCA AGTATTAACT AACCTTCAGC TGCCAAAAGA AACAGCTATT 720 
GACCATGTCC CCGGACACCA AAAAAGCCTT TCTTTTTAA 

Seq ID NO: 89 Protein Sequence 
Protein Accession #: FGENESH predicted 



Leetgwqlg eriltyhgdq vtlctdpees rkisfgghli vntphqvrti lsqkaerwlt 

DSRILKYEAI LLBKDDLTLT TDNSLNPAGF LTGDLYQKRE HTCLVLINYH TKVRPDLGET 
PFRTGQHFID GSSQVIEGKR YNGYSVIDRE TLVEIELGKL PNNWSAQTCE LFALSQMjKD 
LQNQEGTIYT DSKYAFGVAH TFGKIWTERG LINSKCQDLV HKELITQVLT NLQLPKETAI 



I I I 1 I ' 

AGAAACTCAG IgaCCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA G^AAGAAAAA 
GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CGGAGACGTC 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 
CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 
GCCCGCCATG CAGGGCGCTC CTGAGCCGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 
AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCMCAGC 
CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 
TCAACAAGAG CCCGGCCGCT CCCCTTTQGC CGGCCTCCTC GCTCCTCGCC SGAGGCAGCG 
GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC 
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C TCGGCGCTCG CTCGACAGCX: CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 
ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGSGAGAAAG GCGGTCCX3AG GAGCCTCCCA 
TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCX3AGC 
AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 
GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 
;CTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 
IGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 

C AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 
A TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 
^ TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 
: AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 
TAATAAAAGT TGCAAAGGT 



,..„.„„.„^ VI.LVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP P 
ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAH FFRVLLQQLL J 
LPRRSLDSPA ALAERGAHNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL ] 
AQQAHSNRKL MEIIGK 

Seq ID NOi 92 DNA sequence 

Nucleic Acid Accession #: HM_004217.1 

Coding sequence: 58.. 1092 

1 11 21 31 41 51 

GGCCGGCAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG ' 
GCCCAGAAGG AGAACTCCTA CTCCTGGCCC TACGGCGGAC AGACGGCTCC ATCTGGCCTG 
AGCACCCTGC CCCAGCGAGT CCTCfcOGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCIC 
ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 
AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACITTGA GATTGGGCGT 
CCTCTGGGCA AAGGCAAGTT TGGRAACGTG TACTTGGCTC GGGAGRAGAA AAGCCATTTC 
ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 
CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCaACATCCT GCGTCTCTAC 
AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 
CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGAOGAGC AGCKAACAGC CACGATCATG 
GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 
AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 
TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 
CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 
CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 
TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 
GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 
TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAft 
TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 
TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTOTACCT CCTCCTTTGT 
TTAATAAAGG CTGAAGCTTT TTGT 

1 11 21 31 41 51 

111 111 
rUoKEUSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMEN 
^™iLTR HFTIDDFEIG RPLGKGKFGN VYLAREKKSH FIVALKVLFK SQIEKEGVEH 
QL^EIQA hIhHPNILRL YKYFYDRRRI YLILEYAPRG ELYKEUJKSC TFDEQRTATl 

Se^^my chgkkvihrd ikpenlllgl kgelkiadfg wsvhapslrr ktmcgtldyl 

PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHHE TYRRIVKVDL KFPASVPTGA 
QDLISKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 

Seq ID HO: 94 DNA sequence 

Nucleic Add Accession #: NM_007019.1 

Coding sequence: 41.. 580 

1 11 21 31 41 51 

LcACGAGCG icTTCCTCTC icTCTGCCAA CGCOSCCCGG ATGGCTTCCC AAAACCGCGA 
CcS^CGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCI GAGCCGAGCG GGGGOGCCGC 
CCGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATC ACCCTCATGA TOTCTGGCGA 
TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG QGACCATCCA 
?^GCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG A^TCCCCAG 
TffiCTACCCT TACAATGCGC CCACAGTGAA GTTCCTCRCG CCCTGCTATC ACCCCAACGT 
GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 
KTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 
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CCAGCCTGIC 600 



GACTCTTTAT CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA T 

r TTTTGTCCTT TTTTAAAAAA AAAAAAAAflA 



15 KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELWKNPT AFKKYLQBTY SKQVTSQEP 

Seq ID NO I 96 DNA sequence 
Nucleic Acid Accession #: AK055663 
Coding sequence! 38.. 1423 

1 11 21 31 41 51 

I I I I I ' 

AGftACGGCTT CCGGOSGGAG CTGTGCAGCT CCTTATCATG GGGACAATTC ATCTCTTTCG 
AAAACCACAA AGATCCTTTT TTGGCAAGTT GTTACX3GGAR TTTAGACTTG TAGCAGCTGA 
25 COSAAGGTCC TGGAflGATAC TGCTCTTTGG TGTAATAAAC TTGATATGTA CTGGCTTCCT 

GCTTATGTGG TGCRGTTCTA CTAATAGTAT AGCTTTAACT GCCTATACTT ACCTGACCAT 
TTTTGATCTT TTTAGTTTAA TGACATGTTT AATAAGTTAC TGGGTAACAT TGAGGAAACC 
TAGCCCTGTC TATTCATTTG GGTTTGAAAG ATTAGAAGTC CTGGCTGTAT TTGCCTCCAC 
AGTCTTGGCA CAGITGGGAG CTCICTTTAT ATTAAAAGAA AGTGCAGRAC GCTTTTTGGA 
ATACACACGG GAAGATTATT AGTTGGTACT TTTGTGGCTC TTTGITTCAA 



ACAGCCCGAG ATACACACGG GAAGATTATT AGTTUOTA1--1- l irijliAji. 1 1. iiiuiii^-rin 
CCTGTTCACG ATGCTTTCTA TTCGGAATAA ACCTTTTGCT TATGTCTCAG AAGCTCCTAG 
TACGACCTGG CTTCAAGAGC ATGTTGCAGA TCTTAGTCGA AGCTTGTGTG GAATTATTCC 
GGGACTTAGC AGTATCTTCC TTCCCCGAAT GAATCCATTT GTTTTGATTG ATCTTGCTGG 
AGCATTTGCT CTTTGTATTA CATATATGCT CATTGAAATT AATAATTATT TTGCCGTAGA 
CACTGCCTCT GCTATAGCTA TTCCCTTGAT GACATTTGGC ACTATGTATC CCATGAGTGT 
GTACAGTGGG AAAGTCTTAC TCCACACAAC ACCACCCCAT GTTATTGGTC AGTTGGACRA 



ACTCATCAGA GAGGTATCTA CCTTAGATGG AGTTTTAGAA GTCCGAAATG Aauitttitu suv. 

GACCCTAGGT TTTGGCTCAT TGGCTGGATC AGTGCATGTA AGAATTCGAC GAGATGCCAA 960 

TGAACAAATG GTTCTTGCTC ATCTGACCAA CAGGCTGTAC ACTCTAGTGT CTACTCTRAC 1020 

TGTTCAAATT TTCAAGGATG ACTGGATTAG GCCTCCCTTA TTGTCTGGGC CTGTTGCAGC 1080 

CAATGTCCTA AACTTTTCAG ATCATCACGT AATCCCAATG CCTCTTTTAA AGGGTACTGA 1140 

TGATTTGAAC CCAGTTACAT C?JLCTCCAGC TAfiACCTAGT AGTCCACCTC CAGAATTTTC 1200 

ATTTAACACT CCTGGGAAAA ATGTGAACCC AGTTATTCTT CTAAACACAC AARCAAGGCC 1260 

TTATGGTTTT GGTCTCAATC ATGGACACAC ACCTTACAGC AGCATGCTTA ATCAAGGACT 1320 

TGGAGTTCCA CGAATTGGAG CAACTCAAGG ATTGAGGACT GGTTTTACAA ATATACCAAG 13 BD 

TAGATATGGA ACTAATAATA GAATTCGACA ACCAAGACCA TGATAGACTC TAACTTATTT 1440 

TTATAAGGAA TATTGACTCC TTGGCTTCCA ATTTATTTAG TAATCCAACT TTGCATTGAC 1500 

TGTTTAATCA TTTACTCTAA ATGTTAGATA ATAGTAGTCT TGTTCACATT TCATGAAACC 1560 

TATGAAACTA TATTnTGTA AAATGTATTT GTGACAGTGfl AATCCTCGTA AATGTTAAAG 1620 

GCTTTAAATA GGCTTCCTTT AGAAAATGTG TTTCTTTAAA TTTGGATTTT GGTATCTTTG 1680 

GTTTTGTAGT TGACIGCAGT GTGATGTGAC CTTACCTTTA TAAGAGCCAC TTGATGGAGT 1740 

AGATCTGTCA CATTACTAAG ATACGATATT TCTTTTTTTT TCCGAGACGG AGTCTTGCTC 1800 

TGCCACTGTG CCCGGCCAAT ACATTATTAT TAACTTAAGG CTGTACTTTA TTAAGGCTTC 1860 

CTTAGTTTTT GTTrTGTTTT GTTTTTTGAG ATGGAGTCTC ACTCTGTCGC CCAGGCTGGA 1920 

ATCCAGTGGC ATGATCTCAG CTCACTGCAA CCTCTGCCTC CTGAGTTCAA ATGATTCTCC 1980 
TGCCTCAQCC TCCCGAGTAG CTGGGATTAC AGGCACCTGC CACCAOGCCu AGCTAATTTT 
TGTATTTTTA GTAAAGACGG GGGATTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG 



TGTATTTTTA GTAAAGACGG GGGATTTCAC CATUTTUUCC IV^'JK.I'^'^ f-i- i^-^^^^^'^ 
ACCTCATGAT CCACCCACCT TAGCCTCCCA AAGTGCTGGG ATTAGGTGTG AGCCACCGCA 2160 
CCTGGCCGAT ATriTCTTTA ATGAAATTTA TAAATATGCT TCTTGAATAA TACACATTTT 2220 



CCTGGCCGAT ATTTTCTTTA ATGAAAITTA Tiwii«iui.i ii-i iv^rini^. 4„w,-„...- 
GGGAAAGGGA AAAATGTCTG TTCAAAAAGT AAAGGTCTCT TTTATAGCTT TTCCAAACTT 
AATTGCTAAA TTTTTCTTTG AGGTTCTCCT GAATTATGTC TTACAAACTA AAAGCAAAAA 
TTTTTAGCAG AAATTTTGGA ATACATTCTA TCTAGCACAA TTTGAATTTT TAATTATCAA 
GATTTTTGTT AAfiGTTTCTC TCCTTTAAAA ATTTTAGTAC ATTTGTAAAT 



rotein Accession it: 



21 

III!'' 

MGTIHLFRKP QRSFFGKLLR EFRLVAADRR SWKILLFGVI NLICTGFLLM WCSSTNSIAL 
TAYTYLTIFD LFSLMTCLIS YWVTLRKPSP VYSFGFERLE VLAVFASTVL AQLGALFILK 
ESAERFLEQP EIHTGRLLVG TFVALCFNLF TMLSIRNKPF ^YVSEAASTS "^EHVADLS 
RSLCGIIEGL SSIFLPRMMP FVLIDLAGAF ALCITYMLIE INNYFAVDTA SAIAIALKTF 
GTMYPMSVYS GKVLLQTTPP HVIGQLDKLI REVSTLDGVL EVBNEHFWTL GFGSLAGSVH 
VRIRRDANEQ MVLAHVTNRL YTLVSTLTVQ IFKDDWIRPA LLSGPVAAHV LNFSDHHVIP 
MPLLKGTDDL NPVTSTPAKP SSPEPEFSFH TPGKNVNPVI LLNTQTRPYG FGLNHGHTPY 
SSMLHQGLGV PGIGATQGLR TGFTNIPSEY GTNNRIGQPE P 
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1 I I I I I 

ATGGACTTAA CCAATGCTTC ACAAATACCfl. AGAAAAAAAT ATTTCAATGT GATftAATATA 
GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 
AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 
CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATSTG AAGAATGTGG CAAAGCCTTC 
AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 
TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 
CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 
AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCaTC TCTACAAGTG TGAAGAATGT 
GGCAAAGCBT TTAACCSHTGC TGCATGCTTC TTIGTCATTT TGAAGATTTG A 



I I I I I I 

ATGGACTTAA CCAATGCTTC ACRAATACCA AGAAAAAAAT ATTTCAATGT GATAAJ 
GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGi 



GGAAAGTTTT Ti-K i j4ka 1 1 1 i^^^iiwu^ 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 
CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 
AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 
TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 
CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA Qr^'-^'-'^'^- 
AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG T 
IT TTAACCATGC TGCATGCTTC TTTGTCATTT t-^"'"^"'^'''^ = 



Seg ID NO! 100 Protein Sequenc 



MDLTNASQIP RKKYFNVINI GKFFINFQIQ TDIHEDIREK RPFKCIECGK AFNKFSTLTT 
HKKIHTREKP YKCEECGKAF NWSSYLTIHK RIHTGEKLYK CEECGKAFNW SSYLTAYKIT 
HTREKPYKCE ECGKAFKYSS NLTTHKIIHT GEHLYKCEEC GKAFNHAACF FVILKI 



I I I 1 1 1 

GGCACGAGGT CTTGTTATGC AAATAGGCTT CCCACTTGGC AGGGGCCGTC TTGTCCACTC 
GTTTCTGTAA ACATGGGTGG CAAAAAGAGA AGATGGAGCT GCCATTTAGA ACATGCCTAA 
TCCCAGCTTC ATCTTGCTCA GCAAAAATGA AGGAGCCTGG ACCCAACTTT GTTACTGTGA 
GAAAGGGTCT TCATTCATTC AAGATGGCAT TTGTTAAGCA CCTACTGCTG GAGTGCftGTG 
GTTCAATCAC GGATCACTGC AGCCTCCACC TCCCAGTTCA AGRAATTCTC ATGTCTCAGC 
CTCCTGAGCA GCTAGGATTA CAGACAAACC TTGGAAATCA AGRAAGTTCT GGAATGATGA 
AGCTGTTCAT GCCAAGACCG AAAGTGCTGG CCCAGTATGA GTCCATTCAG TTCATGMGT 
GACAATTTTC TTGGAACTCC TTTTTATTGT TAGITCTCAC TTGTTTCCAT ATTTAGTGAA 
TOTMMTTA ATTGCAAAGC TGTCATTAAT AAAAATTCTT ATAGTACCTC ACTGCAC 



MKEPGPNFVT VRKGLHSFKM AFVKHLLLEC SGSITDHCSL HLPVQEILMS QPPEQLGLQT 
NLGNQESSGM MKLFMPRPKV LAQYESIQFM P 



I I I I 1 1 

iGCGGACCGA AgAACGCAGG AAGGGGGCCG GGGGGACCCG CCCCCGGCCG QCCGCAGE?^ 
TCAACTCCAA CGTGGAGAAC CTACCCCCGC ACATCATCCG CCTGGTGTAC AAGGAGGTGA 

Sacactgac cgcagaccca cccgatggca tcaaggtctt tcccaacgag gaggacctca 
ccgacctcca gqtcaccatc gagggccctg aggggacccc atatgctgga ggtctgttcc 
gcatgaaact cctgctgggg aaggacttcc ctgcctcccc acccaagggc tacttcctga 
ccaagatctt ccacccgaac gtgggcgcca atggcgagat ctgcgtcaac gtgctcaaga 

GGGACTGGAC GGCTGAGCTG GGCATCCGAC ACGTACTGCT GACCATCAAG TGCCTGCTGA 
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TCCACCCTAA CCCCGRGTCT GCACTC3iACG 
ACGAGGAGTA TGCGGCTCGG GCCCGTCTGC 
CCAGCGGCAG GGCCGAAGCC GGTCGGGCCC 
ACCCTGGGGC CCCAGGGGGC CCGGGAGGGG 

5 GCGAGCGCGA TAAGRAGCTG GCGGCCAAGA 

CGCTGCGGCG GCTGTAGTGG GCTCTCTTCC 
CCCCTCCCTC CAACICTGTC TCTAAGTTAT 
AGGGGGCACT GGGACCTGGA TTTGTTTTTC 



AGGAGGCGGG CCGCCTGCTC TTGGAGAACT 480 

TCACAGAGAT CCACXSGGGGC GCaSGCGGGC S40 

TGGCCAGTGG CACTGAAGCT TCCTCCACCG 600 

CTGAGGGTCC CATGGCC3UM3 AAGCATGCTG 660 

AAAAGACGGA CAAGAAGCGG GCGCTGCGGG 720 

TCCTTCCaCC GTGACECCAA CCTCTCCTGT 780 

TTAAATTATG GCTGGGGTCG GGGAGGGTAC 840 
TAAATAAAGT TGGAAAAGCA 



MNSNVENLPP HIIRLVYKEV TTLTADPPDG IKVFPNEEDL TDIiQVTIEGP EGTPYAGGI.F 
RMKLLLGKDF PASPPKGyPL TKIFHPNVGA NGEICVNVLK RDWTAELGIR HVLLTIKCLL 
IHPNPESALN EEAGRLLLEN YEEYAAEARL LTEIHGGAGG PSGRAEAGHA LASGTEASST 
DPGAPGGPGG AEGPMAKKHA GERDKKLAAK KKTDKKRALR ALRRL 



1 I I I 1 ' 

CGGCTGAGAG GCAGCGAACT CATCTTTGCC AGTACAGGAG CTTGTGCCGT G 

CCACAGCCCA CAGCCATGGG CTGGGACCTG ACGGTGAAGA TGCTGGCGGG CAACGAATTC 
'C CATGTCGCTG TCAGAGCTGA AGGCGCJiGAT CACCCAGAAG 



ATTGGCGTGC ACGCCTTCCA GCAGCGTCTG GCTGTCCACC CGAGCGGTGT GGCGCTGCAG 
GACAGGGTCC CCCTTGCCAG CCAGGGCCTG GGCCCTGGCA GCACGGTCCT GCTGGTGGTG 
nr-r-nrmiaTi araafiROi-tr! CAGCAGCACC 



GACAGGGTCU ttLTl TtiUUAU uvj»uv~i-v. i u uuv-^-v-^-^™. — — — IZ.ZZrZZ 

GACAAATGCG ACGAACCTCT GAGCATCCTG GTGAGGAATA ACSAGGGCOG CAGCAGCACC 
TACGAGGTCC GGCTGACGCA GACCGTGGCC CACCTGAAGC AGCAAGTGAG CGGGCTGGAG 
GGTGTGCAGG ACGACCTGTT CTGGCTGACC TTCGAGGGGA AGCCCCTGGA GGACCAGCTC 
CCGCTGGGGG AGTACMGCCT CAAGCCCCTG AGCACCGTGT TCATGAATCT GCGCCTGCGG 
GGAGGOGGCA CAGAGCCTGG CGGGCGGAGC TAAGGGCCTC CACCAGCATC CGAGCAGGAT 
CAAGGGCCGG AAATAAAGGC TGTTGTAAGA GAAT 



MGWDLTVKML AGNEFQVSLS SSMSVSELKA QITQKIGVHA FQQRLAVHPS GVALQDRVPL 
ASQGLGPGOT VLLWDKCDB PLSILVRBNK GRSSTYEVRI, TQTVAHLKQQ VSGLEGVQDD 
LFWLTFEGKP LEDQLPLGEY GLKPLSTVFM NLRLRGGGTE PGGRS 

Seq ID NO! 101 DNA sequence 
Nucleic Acid Accession NM_006018 
CoiJlng sequence: 1..1161 

I r T T r r 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACSflGAAGAA CTGCTGTGTG 



ATGAATCGGC ACCATCTGUft uuHiuiuiii i^iuunm^n.^ ™— ~ _ „:"" 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TQGGGCTGGA GTTTATCTTT 
GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA QTCCTGGftAA 
TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TJ^CTACTGAT E^TCTGCCTG 
CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 
CGGCTOGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 
OrGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCIGAA CAAGATCTCC 
AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 
Scc?StGA AGA.GAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 
AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGrTCC TCCTGGAGTT CCTCCTGCCC 
CTGGCCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAftTG 
G&rrGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTClll 
^SiETGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 
TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTI TATCACTCTC 
AGCTTCACCI ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 
TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAQGAAGAT GACAGGTQAG 
CCAGATAATA ACCGCAGCAC GAGOGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 
GCTOAGAGQ OGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 
ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTQGAGAAA 
CAGTTGGGAT GTTGCATCGA G 



wo U3/003906 



PCT/US02/21338 



MNEaJHLQDHF LEIDKKMCCV FRDDFIAKVL PPVLGLEFIF GLLGNQLALH IFCFHLKSWK 60 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

5 RICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLEQRQM DRHAKIKEAI TFIMWAIVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMSNSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 



I I I I I I 

GGCACSSAGOS AGCGGTGCGG AGGCTCTGCT CGGATCGAGG TCTGCAGOT-C AGCTTCGGGA 60 

GCATGAGTGC TGCAGTGACT GCAGGGAAGC TGGCACGGGC ACCGGCCGAC CCTGGG?JiAG 120 

CCGGGGTCCC CGGAGTTGCA GCTCCCGGAG CTCCGGCGGC GGCTCCACCG GCGAA.'\GAGA 130 

TCCCGGAGGT CCTAGTGGAC CCACGCAGCC GGCGGCGCTA TGTGCGGGGC CQCTTTTTGG 240 

GCRAGGGCGG CTTTGCCAAG TGCTTCGAGA TCTCGGACGC GGACACCAAG GAGGTGTTCG 3 00 

CGGGCAAGAT TGTGCCTAAG TCTCIGCTGC TCAAGCCGCA CCAGAGGGAG A^.GATGTCCA 3 SO 

TGGAAATATC CATTCACCGC AGCCTCGCCC ACCAGCACGT CGTAGGATTC CACGGCTTTT 420 

TCGAGGACAA CGACTTCGTG TTCGrGGTCT TGGAGCTCTG CCGCCGGAGG TCTCTCCTGG 4BQ 

AGCTGCAC3UI GAGGAGGAAA GCCCTGACTG AGCCTGAGGC CCGATACTAC CTACGGCAAA 540 

TTGTGCTTGG CTGCCAGTAC CTGCACOGAA ACCGAGTTAT TCATCGAGAC CTCAAGCTGG 600 

GCAACCTTTT CCTGAATGAR GATCTGGAGG TGAAAATAGG GGATTTTGGA CTGGCAACCA 660 

AAGTCGARTA TCACGGGGAG AGGAAGAAGA CCCTGTGTGG GACTCCTAAT TACATAGCTC 720 

CCGAGQTGCT GAGCAAGAAA GGGCACAt3IT TCGAGGTGGA TGTGTGGTCC ATTGGGTGTA 780 

TCATGTATAC CTTGTTAGTQ GGCBAACCaC CTTTTGAGAC TTCTTGCCTA AAAGAGACCT 840 

ACCTCCGGAT CAAGAAGAAT GAATACAGTA TTCCCAAGCA CATCAACCCC GTGGCCGCCT 900 

CCCTCATCCA GAAGATGCTT CAGACAGATC CCACTGCCOG CCCAACCATT AACBAGCTGC 960 

TTAATGACGA GTTCITTACT TCIGGCTATA TCCCTGCCCG TCTCCCXaiTC ACCTGCCTGA 1020 

CCATTCCACC AAGGTTTTCG ATTGCTCCCA GCAGCCTGGA CCCCAGCAAC CGGAAGCCCC 1080 

TCACAGTCCT CAATAAAGGC TTGGAGAACC CCCTGCCTGA GCGTCCCCGG GAAAAAGAAG 1140 

AACCAGTGGT TCGAGAGACA GGTGAGGTGG TCGACTGCCA CCTCAGTGAC ATGCTGCAGC 1200 

AGCTGCACAG TGTCAATGCC TCCAAGCCCT CGGAGCGTGG GCTGGTCAGG CAAGAGGAGG 1260 

CTGAGGATCC TOCCTGCATC CCCATCTTCT GGGTCAGCAA GTGGGTGGAC TATTCGGACA 1320 

AGTACGGCCT TGGGTATCAG CTCTCTCATA ACAGCGTGGG GGTGCTCTTC AATGACTCAA 1380 

CACGCCTCAT CCTCTACAAT GATGGTGACA GCCTQCAGTA CATAGAGCGT GACX3GCACTG 1440 

AGTCCTACCT CACCGTGAGT TCCCATCCCA ACTCCTTGAT GAAGAAGATC ACCCTCCTTA 1500 

AATATTTCCG CAATTACATG AGCGACCACT TGCTGAAGGC AGGTGCCAAC ATCACGCCGC 1560 

GCGAAGGTGA TGAGCTCGCC CGGCTGCCCT ACCTACGGAC CTGGTTCCGC ACCCGCAGCG 1620 
CCATCATCCT CCACCTCAGC AACGGCAGCG TGCAGATCAA CTTCTTCCAG GATCACAEE^ 

AGCTCATCTT GTGCCCACTG ATGGCAGCCG TGACCTACAT CGACGAGAAG CGGGACTTCC 1740 
aCACATACCG CCTGAGTCTC CTGGAGGAGT ACGGCTGCTG CAAGGAGCTC GCCAGCCGGC 
TCCGCTACCC CCGCACTATG GTGGACAAGC TGCTGAGCTC ACGCTCGGCC ASCAACCGTC 
TCAAGGCCTC CTAATAGCTG CCCTCCCCTC CGGACTGGTG CCCTCCTCAC TCCCACCTGC 
ATCTGGGGCC CATACTGGTT GGCTCCCGCG GTGCCATGTC TGCAGTGTGC CCCCCAGCCC 

ccr-aacTGG GCAGAGCTGC ATCATCCTTG CAGGTGGGQG TTGCTGTGTA AGTTAITITT — - 

GTACATGTTC GGGTGTGGGT TCTACAGCCT TGTCCCCCTC CCCCTCAACC CCACCATATG 2100 

AATTGTACAG AATATTTCTA TTGAATTCX3G AACTGTCCTT TCCTTGGCTT TATGCftCATT 2160 
AAACAGATOT GAATATTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



1740 
1800 

1920 
1980 
2040 



MSAAVTAQKL ARAPADPGKA GVPGVAAPGA PAAAPPAKEI PEVLVDPRSR RRYVRGRFLO 
KGGFAKCFEI SDADTKEVFA GKIVPKSLLL KPHQREKMSM EISIHRSLAH QHWGFHGFF 
EDNDFVFWL ELCKRRSLLE LHKKRKALTE PEARYYLRQI VLGCQYLHRN RVIHRDLKLG 
HLFLNBDLEV KIGDFGIATK VEYDGERKKT LCGTPNYIAP EVLSKKGHSF EVDVWSIGCI 
MYTLLVGKPP FETSCLKETY LRIKKNEYSI PKHINPVAAS LIQKMLQTDP TARPTINELL 
HDEFFTSGYI PARLPITCLT IPPRFSIAPS SLDPSNRKPL TVLNKGLENP LP=:RPR=.KEE 
PWRETGEW DCHLSDMLQQ LHSVNASKPS ERGLVRQEEA 3DPACIPIFH VSKKVDYSDK 
YGLGYQLCDN SVGVLFNDST RLILYNDGDS LQYIERDGTE 3YLTVSSHPN SLMKKITLLK 
YFENVMSEHL LKBGANITPR EGDELftRLPY LRTWFRTRSA IILKLSNGSV QINFFQDHTK 
LILCPLMAAV TYIDEKRDFR TYRLSLLEEY GCCKELASEL RYARTMVDKL LSSRSASNRL 



TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAQAAGAQ CAQCAAAGCT GAAGTAGCAG 
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C AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 
TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC C 
TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG C 

rGACAAA ATAGAAGTGA TTATTACCCT G 
\ TCCCAAATCG AAGCAAGCAA GGCTTATAAT C 
rCAAAAC ATATGAAGTC CTGGAAAAGG G 
: AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG T 
r ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG G 

3 ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT C 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA .AA.GCTACCTT A 
GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC T^"^^^]^^^' TT"TGACTAT 
GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 
CTGTGGTTAC AGTGGAGACA TTQACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 
CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT QCAA.2iTACAC ACTTCTTTCC 
CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATGA TTTGGTTTGT 
TTTATAACCR ATTCATTJiAA TGTAATTCAT AAAATGTACT ATGAAAAAA^. TTATACGCTA 
TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAQCACCG GTCTT.^A.TTT 
GATGTTTTTC AACTTTTATT CATTGAGATG T 



.GTACITTTT GTTTTGRTCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 
TACAAAATGT TTTTGTCTAC CAAAGAARAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 
AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 
TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 
i ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 



40 1 I I 1 1 I 

GAATTCGAGG ATCCGGGTAC CATGGGCGGC GGCAGGCCTA GCAGCACGGG AACCGTCCCC i 
CGCGCGCATG CGCGCGCCCC TGAAGCGCCT GGGGGAOSGG TATGGGCGGG AGGTAGGGGC 
GCGGCTCCGC GTGCCAGTTG GGTGCCCGCG CGTCACGTGG TGAGGAAGGA GGCGGAGGTC 
TGAGTTTCGA GGGAGGGGCG GAGAGAAGAG GGAACGAGCA AGGGAAGGAA AGCGGGGAAA 
45 GGAGGAAGGA AACGAACGAG GGGGAGGGAG GTCCCTGTTT TGGAGGAGCT AGGAGCGTTG 
CCGGCCCCTG AAGTGGAGCG AGAGGGAGGT GCTTCGCCGT TTCTCCTGCC AGGGGAGGTC 
CCGGCTTCCC GTGGAGGCTC CGGACCAAGC CCCTTCAGCT TCTCCCTCCG GATCGATGTG 
CTGCTGTTAA CCCGTGAGGA GGCGGCGGCG GCGGCAGCGG CAGCGGAAGA TGGTGTTGCT 
GAGAGTGTTA ATTCTGCTCC TCTCCTGGGC GGCGGGGATG GGAGGTCAGT ATGGGAATCC 
50 TTTAAATAAA TATATCAGAC ATTATGAAGG ATTATCTTAC AATGTGGATT CATTACACCA 
ni^CMG CGTGCCAAAA GAGCAGTCTC ACATGAAGAC CAATTTTTAC GTCTAGATTI 
CCATGCCCAT GGAAGACATT TCAACCTACG AATGAAGAGG GACACTTCCC TTTTCAGTGA 
TGAATTTAAA GTAGAAACAT CAAATAAAGT ACTTGATTAT GATACCTCTC ATATTIACAC 
TGGACATATT TATGaiGAAG AAGGAAGTTT TAGCCATGGG TCTGTTATTG ATGG.AAGATT 
55 TGAAGGATTC ATCCAGACTC GTGGTGGCAC ATTTTATGTT GAGCCAGCAG AGAGATATAT 
TAAAGACCGA ACTCTGCCAT TTCACTCTQT CATTTATCAT GAAGATGATA TTAACTATCC 
CCATAAATAC GGTCCTCAGG GGGGCTGTGC AGATCATTCA GTATTTGAAA GAATGAGGAA 
ATACCSIGATG ACTGGTGTAG AGGAAGTAAC ACAGATACCT CAAGAAGAAC ATGCTGCTAA 
TGGTCCAGAA CTTCTGAGGA AAAAACGTAC AACTTCAGCI GAAAAAAATA CTTGTCAGCT 
60 TTRTATTCAQ ACTGATCATT TGTTCTTTAA ATATTACGGA ACACGAGAAG CTGTGATTGC 
CCSGATATCC AGTC»TGTTA AAGCGATTGA TACAATTTAC CAGACCACAG ACTTCTCCGG 
AATCCGTAAC ATCAGTTTCA TGGTGAPACG CATAAGAATC AATACAACTG CTGATGAGAA 
■ GGACCCTACA AATCCTTTCC GTTTCCCAAA TATTGGTGTG GAGAAGTTTC TGGAATTGAA 
TTCTGAGCAG AATC^TGATG ACTACTGTTT GGCCTATGTC TTCACAGACC GAGATTTTGA 
65 TGATGGCGTA CTTGGTCTGG CTTGGGTTGG AGCACCTTCA GGAAGCTCTG GAGGAATATG 
TGAAAAAAQT AAACTCTATT CAGATGGTAA GAAGAAGTCC TTAAACACTG GAATTATTAC 
TGTTCAGAAC TATGGGTCTC ATGTACCTCC CAAAGTCICT CACATTACTT TTGCTCACGA 
AGTTGGACAT AACTTTGGAT CCCCACATGA TTCTGGAACA GAGTGCACAC CAGGAGAATC 
TAAGAATTTQ GGTCAAAAAG AAAATGGCAA TTACATCATG TATGCAAGAG CAACATCTGG 
70 GGACAAACTT AACAACAATA AATTCTCACT CTGTAGTATT AGAAATATM GCCAAGTTCT 
TGAGAAGAAG AGAAACAACT GTTTTGTTGA ATCTGGCCAA CCTATTTGTG GAAATGGAAT 
GGTAGAACAA GGTGAAGRAT GTGATTGTGG CTATAGIGAC CAGTGTAAAG ATGAATGCTG 
CTTCGATGCA AATCAACXU«3 AGGGAAGAAA ATGCAAACTG AAACCTGGGA AACAGTGCAG 
TCCAAGTCAA GGTCCTTGTT GTACAQCACA GTGTGCATTC AAGTCAAAGT CTGAGAAGTG 
75 TCGGGATGAT TCAGACTGTG CSAGGGAAGG AATATGTAAT GGCTTCACAG CTCTCTGCCC 
AGCATCTGAC CCTAAACCAA ACTTCACAGA CTGTAATAGG CATACACAAG TGTGCATTAA 
TGGGCAATGT GCAGGTTCTA TCTGTGAGAA ATATGGCTTA GAGGAGTGTA CGTGTGCCAG 
TTCTGATGGC AAAGATGATA AAGAATTATG CCATSTATGC TGTATGAAGA AAATGGACCC 
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ATCAACTTGT GCCAGTACflG GGTCTGTGCA GTGGAGTAGG CACTTCAGTG GTCX3AACCAT 2340 

CACCCTGCAA CCTGGATCCC CTTGCAACGA TTTTAGAGGT TACTGTGATG TTTTCATGCG 2400 

GTGCAGATTA GTAGATGCTG ATGGTCCTCT AGCTAGQCTT AAAAAAGCAA TTTTTAGTCC 2460 

AGAGCTCTAT GAAAACATTG CTGAATGGAT TGTGGCTCAT TGGTGGGCaG TATTACTTAT 2520 

5 GGGAATTQCT CTGATCATGC TAATGGCTGG ATTTATTAAG ATATGCAGTG TTCATACTCC 2580 

AAGTAGTAAT CCAAAGTTGC CTCCTCCTAA ACCACTTCCA GGC3^CTTTAA AGAGGAGGAG 2640 

ACCTCCACAG CCCflTTCAGC AACCCCAGCG TCAGCGGCCC CGAGAGAGTT ATCAAATGGG 2700 

ACACATGAGA CGCTAACTGC AGCTTTTGCC TTGGTTCTTC CTAGTGCCIA CAATGGGAAA 2760 

ACTTCACTCC AAAGAGAAAC CTATTAAQTC ATCATCTCCA AACTAAACCC TCACAAGTAA 2B20 

10 CAGTTGAASA AAAAATGGCA AGAGATCATA TCCTCAGACC AGGTGGAATT ACTTAAATTT 2B80 

TAAAGCCTGA AAATTCCAAT TTGGGGGTGG GAGGTGGAAA AGGAACCCAA TTTTCTTATG 2940 

AACAGATATT TTTAACTTAA TGGCACAAAG TCTTAGAATA TTATTATGTG CCCCGTGTTC 3000 

CCTGTTCTTC GTTGCTGCAT TTTCTTCACT TGCAGGCAAA CTTGGCTCTC AATAAACTTT 3060 

TACCACAAAT TGAAATAAAT ATATTTTTTT CAACTGCCAA TCAAGGCTAG GAGGCTCGAC 3120 

15 CACCTCAACA TTGGAGACAT CACTTGCCAA TGTACATACC TTGTTATATG CAGACATGTA 3180 

TTTCTTACGT ACACTGTACT TCTGTGTGCA ATTGTAAACA GAAATTGCAA TATGGATGTT 3240 

TCTTTGTATT ATAAAATTTT TCCGCTCTTA ATTAAAAATT ACTGTTTAAT TGACATACTC 3300 

AGGATAACAG AGAATGGTGG TATTCAGTGG TCCAGGATTC TGTAATGCTT TACACAGGCA 3360 
GTTTTGAAAT GAAAATCAAT TTACCCCATG GTACCCGGAT CCTCGAATTC 

20 



25 1 11 I II 

MVLLRVLILL LSWAAGMGGQ YGNPLNKVIR HYEGLSYNVD SLHQKHQRAK RAVSHEDQFL 
RLDFHAHGRH ENLRMKKDTS LFSDEFKVET SNKVLDYDTS HIYTGHIYGE EGSFSHGSVI 
DGRFEGFIQT HGGrTFYVEPA ERYIKDRTLP FHSVIYHEDD INYPHKYGPQ GGCADHSVFE 
RMRKYQMTGV EEVTQIPQEE HAANGPELLR KKRTTSABKM TCQLYIQTDH LFFKYYGTRE 

30 AVIAQISSHV KAIDTIYQTT DFSGIRNISF MVKRIRINTT ADEKDPTNPF RFPNIGVEKF 
LELNSBQNHD DYCLAYVFTD RDFDDGVLGL AWVGAPSGSS GGICEKSKLY SDGKKKSLNT 
GIITVQNYGS HVPPKVSHIT FAHEVGHHFG SPHDSGTECT PGESKNLGQK ENGNYIMYAR 
ATSGDKLMNN KFSLCSIRNI SQVLEKKENN CFVESGQPIC GNGMVEQGEE CDCGYSDQCK 
DECCFDAMQP EGRKCKLKPG KQCSPSQGPC CTAQCAFKSK SBKCRDDSDC AREGICNGFT 

3 5 ALCPASDPKP NFTDCNRHTQ VCINGQCAGS ICEKYGLEEC TCASSDGKDD KELCHVCCWK 

KMDPSTCAST GSVQWSEHFS GRTITLQPGS PCNDFRGYCD VEMRCRLVDA DGPLARLKKR. 
IFSPELYENI AEWIVAHWWA VLLMGIAI.IM LMAQFIKICS VHTPSSNPKL PPPKPLPGTL 
KRRRPPQPIQ QPQRQRPHES YQMGHMRR 

40 Seq ID NO! 115 DNA sequence 

Nucleic Acid Accession « ! NM_000S77.2 
Coding sequence: 41..520 

1 11 21 31 41 51 

45 1 I ! I 1 1 

GGCRCGAGGG GRAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 
CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 
GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCMG GACCAAATGT 
CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT 
50 CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 
GGAGGCAGTT AACATCACTG ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTOSCCTT 
CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 
CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 
CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 
55 TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCCG 
GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 
CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 
GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 
TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 
60 CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 
ACCAAGTGGC TCCCACACCC TGTTTTACAA RAARGAAAAG ACCAGTCCAT GAGGGAGGTT 
TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT tTTTTTTTTT CAGTCCCCGT 
GAAGGAGAGC CCTTCATTTC GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 
ATTCCTGCAI TTGTGAAATG ATGGTGfiAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 
65 TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 
ATTTTTTTTI TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 
CCCAQCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 
TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCI GGGGGTTCTT 
TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 
70 ACTTQTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 
GAGCAGGAAA CATGACTCMT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 
CTCTTGGCAG GTRCTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 
CTQTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAA 
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S GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 
HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSBNRK QDKRFAFIRS DSGPTTSFES 
r AMEADQPVSL TNMPDEGVMV TKFYFQEDE 



GCACTTTA3C TCAGTGATGG ATTTTAAGAA CGATTACACC TGTCGCCTGT G 
CAGGCACTCG CGTCAGGTAC TTCTGCTCCA GGATAGCTTT ATGAATTGCT C 

15 CATCAATGGT TCCTTTCGTG CGCTTGGCTT TATTCATGAG GCTCAGGTCG G 

GATGGTCCAC TGTGACAGCA AGACAGGTAA TGCAAATACG GATTTCATCT GGGTGGGTCC : 
AGATAACAGA CTGCTAGAGC CGGATAAAGA GATGGAAAAC TTTTACGTGT TTCACRATGG 3 
AAGTCTGGTT ATAGAAAGCC CTCGTTTTGA GGATGCTGGA GTGTATTCTT GTATCGCAAT 3 
GAATAAGCAA CGCCTGTTAA ATGAAACTGT GGACGTCACA ATAAATGTGA GCAATTTCAC 

20 TGTAftGCAQA TCCCATGCTC ATGAGGCATT TAACACAGCT TTTACCACTC TTGCTGCTTG < 
CGTGGCCAQT RTCGTTTTGG TACTTTIGTA CCTCTATCTQ ACTCCATGCC CCTGCAAGTG E 
TAAAACCAAG AGACAGAAAA ATATGCTACA CCAAAGCAAT GCCCATTCAT CGATICTCAG f 
TCCTGGCCCC GCTAGTGATG CCTC03CTGA TGAAOGGAAG GCAGGTGCAG GTAAAAGAGT f 
GGTGTTTTTG GAACCCCTGA AGGATACTGC AGCAGGGCAG AACGGGAAAG TCAGGCTCTT 

25 TCCCAGCtSAG GCAGIGATAG CTGAGGGCRT CCTAARGTCC ACGAGGGGGA AATCTGACTC 
AGATTCAQTC AATTCAGTGT TTTCTGACAC ACCTTTTGTG GOGTCCACTT AA 

Seq ID NO! 118 DNA sequence 
Nucleic Acid Accession Hi AB032989.1 
30 coding sequence: 1..1316 

LcATTCCTT LcCSGTTAC ACAGCACTAC TGGACCTCAG TCACAACAAC CTGAGCCGCC 
35 TGCGGGCCGA QTGGACCCCC AOSCGCCTGA CCC3U«:TGCA CTCCCTGCTG CTGAGCCACA 
ACCACCTGAA CTTCATCTCC TCTGAGGCCI TTTCCCCGGT ACCCAACCTG CGCTACCTGG 
ACCTCTCCTC CAACCAGCTG OGTACRCTGG ATGAGTTCCT GTTCAGTGAC CTGCSiAGTAC 
TGGAGGTGCT GCTGCTCTAC AATAACCACft TCRTGGCGGT GGACCGGTGC GCCTTCQATG 
ACATGGCCCA GCTGCAGAAA CTCTACTTGA GCCXGAACCA GATCTCTCGC TTCCCTCTGG 
40 AACTGGTCAA GGAAGGAGCC AAGCTACCCA AACTAACGCT CCTGGATCTC TCTTCTAACA 
AGCTGAAGAA CTTGCCATTG CCTGACCTGC AGRAGCTGCC GGCCTGGATC AAGAATGGGC 
TGTACCTACA TAACAACCCC CTGAACTGCX! ACTGTGAGCT CTACCAGCTG TTTTCACACT 
GGCAGTATCG GCAGCTGAGC TCCQTGATGG ACTTTCS^AGA GGATCTGTAC TGCATGAACT 
CCAAQAAGCT GCACAATGTC TTCAACCTGA GTTTCCTCAA CTGTGGCGAG TACAAGGAGC 
45 GTGCCTGGGA GGCCCACCTQ GGTGACACCT TGATCATCAA GTGTGACACC AAGCAGCAAG 
GGATGACCAA GGTGTGGQTG ACACCAAOTA ATGAACGGGT GCTAGATGAG GTGACCAATG 
QCACAGTGAG TGTGTCTAAG GATGGCAGTC TTCTTTTCCA GCAGGTGCAG GTOGAGGACG 
GTGGTGTGTA TACCTQCTAT GCCATGGGAG AGACTTTCAA TGAGACACTG TCTGTGGAAT 
TGAAAGTGCA CAATTTCACC TTGCACGGAC ACCATGACAC CCTCAACACA GCCTATACCA 
50 CCCTAGTGGG CTGTATCCTT AGTGTGGTCC TGGTCCTCAT ATACCTATAC CTCACCCCTT 
GCCGCTGCTG GTaCCGGGGT GTAGAGAAGC CTTCCAGCCA TCAAGGAGAC AGCCTCAGCT 
CTTCCATQCT TAQTACCACA CCCAACCATQ ATCCTATGGC 7GGTGGGGAC AAAGATGATG 
GTTTTGACCG GCGGGTGGCT TTCCTQQAAC CTGCTGGACC TGGGCAGGGT CAAAACGGCA 
AGCTCAAQCC AGGCAACACC CTGCCAGTGC CTGA6GCCAC AGGCAAGGGC CAAOGGAGGA 
55 TGTCGGATCC AGAATCAGTC AGCTCGGTCT TCTCTGATAC GCCCATTGTG GTGTGAGCAG 
GATGGGTTGG TGGGGAGATT CTGCCCCAGG AGAGGTAATG CACCCCTGAA GGATATGAGG 
GGATGGAAGA GAGGGCTGGC TGCCCAAGGG AATGGATTCC TCCTGACCTC AAGGGAATTG 
GTCCCAGGTA CAACAGAQAG CAAGACCCCA AACAGGGCGT GGCTGCCACG ATTTTCAAAT 
GGGGGTATAT TAATCCTCAG GCAAATGCTA CACCCGTACC CAAAGCCCTG CCAATTCTCA 
60 GTGTGGTAGA GGAAGAGAAG CATGTCTGAG TTGGATGGGA ATGAGGAAGG AGTGAGGGGA 
AGAACAGATT CCCTAAACTT TCATGAGGTC ATCCCTAGCT CCTTAAGAGA AAAACATTTA 
GAAAAAAAAA TTTTTTCTAT CTCTGCCCAC CCACACCTGT GATGTTGTGT GTGTTGGGGG 
AQCTTCCACA GGAGCCACAC TGGGGAAAAG CTGTAGTATC TTCTTTGTTT AGCAAGTCAA 
ACTTCACAGC TQGGGAAGTT GAAAACCTTT GGAAAATGAG TCAGGAAAAG GCIGAGACCT 
65 CAATCGGTAA CAATTCCCCA AAGCIGTTGT AAGACTTTGC ATTAAAGCCT TTCTTTTTCC 
TGATGTGCCC TGGGTCGATG GATCITTGCA GGAGCCTGGC CTGCTGCCTT TTAGTTACAA 
CAGCAATGTG GQATGTGGTA TCTCTGCCTT GTCCTGGGAC CAATGGCACA GAGGGTTATA 
GCATGAACTG AAGGGTCCAT GTCACAGCAT AGGTGCCAGC ACGAAACTTG TAGATGGGGG 
GTGAATTCCA GATAGGCTAA TTCCCTGGTT GGTGAAACCC TTGGTTAGTA TTTGATTCCC 
70 AGCATCTGTG TGTCAGGCCA GAGTGGGGGC TGCTGAAGGC AGAGCTTTTA TQGATGCCTG 
CCACAGACGT GGTACATGGG CCCACTTCAC GATAG.»AGGA TGAAGGTAAA TCAGCCTCTA 
TTATGTAGCA TCCAGCTGGA CCTAGCTGAT TCTCAATGTA TGCCATCTCT TCATCCAGTC 
TCCTTATO^ GCTCAGGTGG GAITCTGAAT TTTCCCTAAT ATGCTGCTCT GTAAAGGGAG 
GAGTCCAAGA AAGCCTGGCC CCAGCTGGCT TAGAGTAGGC TTGGCGATAG TGAAAATCCA 
75 GTATATATCT CCTCTGTGTG GACAAGGTTG CCTTQCTCCT CTCAGAGATA CCACCTCTGC 
TGCAAAGTCA AAGCATGGGG TTTTGACATC AGAAGGGTTG GAATTGGGTG GTCTCTACCC 
CAGAATGGCA ATTTGGAGTG CAGCTTCRAG TTCTGCCCTC TCCACAGATT CCAGGCAATC 
CTGAAGCAGA GATAATCAGT AGTACTAAAC TTTCAGAAGG GGAGTTCTGC CACCCCATTC 
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S AGGTCTTTGC AGTTGTGGTT GTGftGGCCftG AGGAACTITAA GAAGAATGRG 2760 

CrCAACRTTC CATTCCATCC AAGGTTGCAG AATACTTTGA GTATGGftGAT GACCGATCCC 2820 

TTTAACETTT CCCITTGACT ATACTGCATG CCCTGTGAGT GGCTGCTICTG AGCTTTGGCG 2 8B0 

GGGGATGAGC AGGGTGAAGG TGAAAATGTT CTCTGATGTT GGTTTTTCTT AGTGTCTCTT 2940 

CTCTCCCACG TTTCTAGTCC ATTGGAAGTT CTCCAGATCC TCTAGATTTC TGACCTCTTT 3000 

TTACATTCTA AGAGGTGTCA TTTACCTGTG GGTGAGGGGC TTGGAGGGCC TGACTATTAA 3060 

GCTTCOIATT ATGATTGGGG AAGGGGACTC AACATCTTCC CCTTGTCTGG TCCIGGGAAC 3120 

TCAAGACTGC TGTCCATCCT TTATTAGTAA CCTAGTTTTG TTGAGAGAGA GGTGCAGCAT 3180 

TTCCCTCAAG TACCTGCATG CTCATAGGTA TGCATGCTTA TAGGTATGCA TGCaCACCAG 3240 

CACCTTGACT TCCACCAGGA CAGTGAATCT GTAGTCOCCA TTAATGATAC TCTCAGAATG 3300 

GTCTCCACCA ATGTGCACAA CTTCATGCAC ACAATTGIAT TCCTCTTTAG CjAGAGACAGT 3360 

GACCATGCCA GGCTCTTCAA ATACACACTC CTGTATITTC ATTTCTGAAC TCCCTGAAGC 3420 

TTCACCCACA TCTCCATGGC ACTGAATOiC TTTTCTGCCC AGGCTGGAGT TAGAAGAACC 3480 

ATGGGTTGAA GGCCTTGGGA GAAGAGGTGG TAAGGAGCTA CCTGGTTCTT TQGACTCCTC 3540 

AGAAACCTCC CGTGAAGACT ATATATATGA GCAATATATG AGCAATTAAG ATAGAACTAG 3600 

GTCATTGAAG GACTCAGAAG GAAACTGGCA AGGGGCTRAG TAGCTGTGAG CTGCTCTICC 3660 

TGTTAACTGT CCTCAGGAGA GAAAAGTTGT CTTGATTTAT TCCAGAGACT CCTGAGAATT 3720 

TTGTTACTGG AGCTATAGGA ACTAGGACTC CCTCCTCAGA CTCATCAGGA AATCTGGGAA 3780 

AAAGTGAGCG TTAAATGAAG CAGGCCTTCT GGQaCATCGT GATGGAGCTC CTGAGGACTG 3900 

TCTTAAGTCT CCTGACTATA TCCACTAGTT TGGCCTTGAG TTTGAGACAC CTGGTAGATG 4020 

TACCTGGCTG CCCTACCAAG AGATTTGATT TGTTTCTTCC CGTGGTTTCA GGTGATAGAA 40S0 

CCCAAAGGTA GCTTTTTAAT TGGAGAATTG AATAATCAGC ATATGTAAGC GCACTAGAAC 4140 

CCTGTGTTGA AAACTGCCAG GTGTGGGTAT AAGAA.^AGGC CGAGAGATCA CCACCTCCTC 4200 

TTACCCTACC CCACACCCAT AAACCAGACA TGTCTCCCAG GAAGCAGGTG TCCCTGGACA 4260 



3 TACACCCCAG C 
r TCTTACCTCT GTACTTCCTC GTAGCTCTGC TGACAAAGCA 
:CACTTG TGTACACCTA GGCTGGCAAA 
TCTTGGAGCC TCTGGGCTCT GAAAACTAGA CAATGATCAT TAAACCTGGC TTGAGTCTCT 
GTTCTGGC 

Seq ID NOi 119 Protein Sequence 
35 Protein Accession H: BAAB6477.1 

1 11 21 31 41 51 

I I I I I 1 

HSLPSYTALL DLSHNNLSRL RAEWTPTRLT QLHSLLLSHN HLNPISSEAF SPVPNLRYLD 

40 LSSNQLRTLD EFLFSDLQVL EVLLLYNNHI MAVDRCAFDD MAQLQKLYLS QNQISRFPLE 
LVKEGAKLPK LTLLDLSSNK LKNLPLPDLQ KLPAWIKNGL YLHNNPLHCD CELYQLFSHW 
QYRQLSSVMD FQEDLYCMHS KKLHNVFNLS FLNCSEYKER AWEAHLGDTL IIKCDTKQQG 
MTKVWVTPSN ERVLDEVTNG TVSVSKDGSL LFQQVQVEDG GVYTCYAMGE TFNETLSVEL 
KVHNPTLHGH HDTLHTAYTT LVGCILSWL VLIYLVLTPC RCWCRGVEKP SSHQGDSLSS 

45 SMLSTTPNHD PMAGGDKDDG FDRRVAFLEP AGPGQGQNGK LKPGNTLPVP 3ATGKGQRRM 
SDPESVSSVF SDTPIW 

Seq ID NO: 120 DMA sequence 
Nucleic Add Accession #i NM_005071 
50 Coding sequencei 9.. 1703 

1 11 21 31 41 51 

GATAGACCAT GAGCAGCCAT GGCRACAGCC TGTTCCTTCG GGAGAGCGGC CAGCGGCTGG 

55 GCCGGGTGGG CTGGCTGCAG CGGCTGCAGG AAAGCCTGCA GCMAGAGCA CTGCGCACGC 
GCCTGCGCCT GCAGACCATG ACCCTCGAGC ACGTGCTGCG CTTCCTGCGC CGAAACGCCT 
TCATTCTGCT GACGGTCAGC GCCGTGGTCA TTGGGGTCAG CCTGGCCTTT GCCCTGCGCC 
CATATCAGCT CACCTACCGC CAGATCAAGT ACTTCTCTTT TCCTGGAGAG CTTCTGATGA 
GGATGCTGCA GATGCTGGTG TTACCTCTCA TTGTCTCCAG CCTGGTCACA GGTATGGCAT 

60 CCCTGGACAA CAAGGCCACG GGGCGGATGG GGATGCGGGC AGCTGTGTAC TACATGGTGA 
CCACCATCAT CGCGGTCTTC ATCGGCATCC TCATGGTCAC CATCATCCAT CCCGGGAAGG 
GCTCCAAGGA GGGGCTGCAC CGGGAGGGCC GGATCGAGAC CATCCCCACA GCTGATGCCT 
TCATGGACCT GATCAGAAAT ATGTTTCCAC CAAACCTTCT GGAGGCCTGC TTCAAACAGT 
TCAAGACGCA GTACAGCACG AGGGTGGTAA CCAGGACCAT GGTGAGGACA GAGAACGGGT 

65 CTCAGCCGGG TGCCICCATG CCTCCICCAT TCTCAGTGGA GAACGGAACC AGCTICCTGG 
AAAATGTCAC TCGGGCCTTG GGTACCCTGC AGGAGATGCT GAGCTTTGAG GAGACTGTAC 
CCGTGCCTGG CTCCGCCAAT GGCATCAACG CCCTGGGCCT CGTGGTCTTC TCTGTTGGCCT 
TTCGGCTGGT CATTGGTGGC ATGAAACACA AGGGCAGAGT CCTCAGGGAC TTCTTCGACA 
GCCTCAATGA GGCTATTATG AGGCTGGTGG GCATCATTAT CTGGTATGCA CCTGTGGGCA 

70 TCCTGTTCCT GATTGCTGGG AAGATTCTGG AGATGGAAGA CATGGCCGTC CTGGGGGGTC 
AGCTGGGCAT GTACACCCTG ACCGTCATCG TGGGCCTGTT CCTCCATGCC GGCATTGTCC 
TTCCCCTCAT GTACTTCCTC GTCACTCACC GGAACCCCTT CCCCTTCATT GGGGGCATGC 
TACAAGCCCT CATCACCGCT ATGGGCACGT CTTCCAGCTC GGCAACGCTG CCCftTCACCT 
TCCGCTGCCT GGAGGAGGGC CTGGGTGTGG ACCGCCGCAT CACCAGGTTC GTCCTGCCCG 

75 TGGGCGCCAC GGTCAACATG GATGGCACTG CCCTCTACGA GGCCCTGGCT GCCATCTTCA 
TTGCTCAAGT TAACAACTAC GAGCTCAACC TGGGTCAGAT C3«:AACCATC AGCATCACGG 

ccacagcagc cagtgttggg gctgctggca tcccccaggc gggtctggtc accatggtca 
ttgtgcttac gtciggtcggc ttgcccacbg aagacatcac gctcatcatc gccgtgqact 
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GGTTCCTTGA CCGGCTTCGC ACAATGACCA ACGTACTGGG GGACTCAATT GGAGCGGCCG 
TCATCGRGCA CTTGTCTCAG CGGGAGCTGG ftQCTTCAGGA AQCTGAGCTT ACCCICCCCA 
» ACCCTACAAG TCCCTCATGG CACAGGAGAA GGGGGCRTCC CGGGGACGGG 
A GAGTGCTATG TGAGGGGCCT CCAGCTCTG 



I I I I 1 

MSSHGNSLFL RESGQRLGRV GWLQRLQESL QQRALRTRLR LCTMTLEHVL 
LTVSAWIGV SLAFALRPYQ LTYRQIKYFS FPGELLMRKL QKLVLPL:V3 
NKATGRMGMR AAVYYMVTTI lAVFIGILMV TIIHPGKGSf; SGLHREGRIS 
lilRNMFPPNL VEACFKQFKT QYSTRWTRT MVRTENGSEP GASMPPPPSV 
TRALGTLQEM LSFEETVPVP GSANGINALG LWFSVAFGL VIGGMKHKGR VLRDFFDSLN 300 
EAIMRLVGII IWYAPVGILF LIAGKILEMB DMAVLGGQLG MYTLTVIVGL FIiHAGIVIiPL 360 
lYFLVTHRNP FPFIGGMLQA LITAMGTSSS SATLPITFRC LEEGLGVDRR ITRFVLPVGA 420 
TVNMDGTALY EALRAIFIAQ VNNYELNLGQ ITTISITATA ASVGAAGIPQ AGLVTMVIVL 480 
[i GDSIGAAVIE HLSQRELELQ EAELTLPSLG 540 



Seq ID NO: 122 DNA sequence 

Nucleic Acid Accession #i NM_031966.1 

Coding sequence: 178.. 1479 

1 11 21 31 41 51 

I I I I I 

ACGAACAGGC CAATAAGGAG GGAGCaGTGC GGGGTTTAAA TCTGAGGCTA GGCTGGCTCT 6 0 

TCTCGGCGTG CTGOSGCGGA ACGGCTGTTG GTTTCTGCTG GTTGTAGGTC CTTGGCTGGT 120 

CGGGCCTCCG GTGTTCTGCT TCTCCCCGCT GAGCTGCTGC CTGGTGAAGA GGAAGCC»TG 180 

GCGCTCCGAG TCACCAGGAA CTCGAAAATT AATGCTGRAA ATAAGGCGAA GATCAAC»TG 240 

GCAGGCGCAA AGCGOSTTCC TACGGCCCCT GCTGCAACCT CCAAGCCCGG ACTGAGGCCA 300 

AGAACAGCTC TTGGGGACAT TGOTAACAAA GTCAGTGAAC AACTGCAGGC CAAAATGCCT 360 

ATGAAGAAGG AAGCAAAACC TTCAGCTACT GGAAAAGTCA TTGATAAAAA ACIACCAAAA 420 

A AGGTACCTAT GCTGGTGCCa GTGCCAGTGT CTGAGCCAGT GCCAGAGCCA 480 

C CAGAACCTGA GCCTGTTAAA GAAGAAAAAC TTTCGCCTGA GCCTATTTTG 540 

G CCTCTCCAAG CCCAATGGAA ACaTCTGGAT GTGCCCCTGC AGAAGAAGAC 600 

0 CTTTCTCTGA TGTAATTCTT GCAGTAAATG ATGTGGATGC AGAAGATGGA 660 

GCTGATCCAA ACCTTTGTAG TGAATATGTG AAAGATATTT ATGCTTATCT GAGACAACTT 720 

GAG3AAGAGC AAGCAGTCAG ACCAAAATAC CTACTGGGTC GGGAAGTCAC TGGAAACATG 780 

AGAGCCATCC TAATTGACTG GCTAGTACAG GTTCAAATGA AATTCAGGTT GTTGCAGGAG 840 

ACCATGTACA TGACTGTCTC CATTATTGAT CGGTTCATGC AGAATAATTG TGTGCCCAAG 900 

AAGATGCTGC AGCTGGTTGG TGTCACTGCC ATGTTTATTG CAAGCAAATA TGAAGAAATG 960 

TACCCTCCAG AAATTGGTGA CTTTGCTTTT GTGACTGACA ACACTTATAC TAAGCACCAA 1020 

ATCAGACAGA TGGAAATGAA GATTCTAAGA GCTTTAAACT TTGGTCTGGG TCGGCCTCTA 1080 

CCTTTGCACT TCCTTCGGAG AGCATCTAAG ATTGGAGAGG TTGATGTCGA GCAACATACT 1140 

TTGGCCAAAT ACCTGATGGA ACTAACTATG TTGGACTATG ACATGGTGCA CTTTCCTCCT 1200 

TCTCAAATTG CAGCAGGAGC TTTTTGCTTA GCACTGAAAA TTCTGGATAA IGGTGAATGG 1260 

ACACCAACTC TACAACATTA CCTGTCATAT ACTGAAGAAT CTCTTCTTCC AGTTATGCAG 1320 

CACCTGGCTA AGAATGTAGT CATGGTAAAT CAAGGACTTA CAAMCACAT GACTGTCAAG 1380 

AACAAGTATG CCACATCGAA GCATGCTAAG ATCAGCACTC TACCACAGCT GAATTCTGCA 1440 

CTAGTTCAAG ATTTAGCCAA GGCTGTGGCA AAGGTGTAAC TTGTAAACTT GAGTTGGAGT 1500 

ACTATATTTA CAAATAAAAT TGGCACCATG TGCCATCTGT AAAAAAAAAA AAAAAAAAAA 1560 



MALEVTRNSK INAENKAKIN MAGAKRVETA EAATSKPGLR PRTALGDIGN KVSEQLQAKM 60 
PMKKEAKPSA TGKVIDKKLP KPLEKVPMLV PVPVSEPVPE PEPEPBPEPV KEBKLSPEPI 12 0 
LVDTASPSPM ETSGCAPAEE DLCQAFSDVI LAVNDVDAED GADPNLCSEY VKDIYAYLRQ 180 
LEEEQAVRPK YLLGREVTGN MRAILIDWLV QVQMKPRLLQ ETMYMTVSII DEFMQNNCVP 240 
KKMLQLVGVT AMPIASKYEE MYPPEIGDFA PVTDNTYTKH QIRQMEMKIL EALNFGLGRP 300 
LPLHPLRRAS KIGEVDVEQH TLAKYLMELT MLDYDMVHFP PSQIAAGAFC LALKILDNGE 360 
— KISTLPQLNS 420 



1 #: Eos sequence 



TGAAAGATAC 
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r TGAGATAAAC AAACTTCCAG T 

A AATCO^AGftA ATGCftGCftCT TCTTGGGTCT GAAAGTGACC 
GGGCAACTGG ACACATCTAC CCTGGAGATG ATGCACGCAC C 
GTCCATCATT TCAGGGAAAT GCCRGGGOSG CCCGTATGGA G 
AGAATCAATA ATTACACACC TGACATGAAC CGTGAGGATG T 
GCTTTCCAAG TATGGAGTAA TGTTACCCCC TTGAAATTCA GCAAGATTAA C 
GCTGACATTT TGGTGGTTTT TGCCCGTGGA G 
AAAGGTGGAA TCCTAGCCCA TGCTTTTGGA CCTGGATCTG G 
TTCGATGAGG ACGAATTCTG GACTACACAT T 
GTTCACGAGA TTGGCCATTC CTTAGGTCTT GGCCATTCTA G 
TTCCCCACCT ACAAATATGT TGACATGAAC ACATTTCGCC TCTCTGCTGA T 

r CCCTGTATGG AGACCCAAAA GAGAACCAAC GCTTGCCAAA TCCTGACAAT 

h CCCCAATTTG A 
r TCTTCAAAGA CAGGTTCTTC TGGCTGAAGG T 

CTTATGGCCA ACCTTGCCAT CTGGCATTGA AGCTGCTTAT 1020 
GAAATTGAAG CCAGAAATCA AGTTTTTCTT TTTAAAGATG ACAAATACTG GTTAATTAGC 1080 
: CAGAGCCAAA TTATCCCS^G AGCATACATT CTTTTGGTTT TCCTAACTTT 1140 
\ TTGATGCAGC TGTTTTTAAC CCACGTTTTT ATAGGACCTA CTTCTTTGTA 1200 
GATAACCAGT ATTGGAGGTA TGATGAAAGG AGACAGATGA TGGACCCTGG TTATCCCAAA 1260 
CTGATTACCA AGAACTTCCA AGGAATOGGG CCTAAAATTG ATGCAGTCTT CTACTCTAAA 
AACAAATACT ACTATTTCTT CCAAGGATCT AACCAATTTG A - - — 

CGTATCACCA AAAC3«:TGAA AAGCAATAGC TGGTTTGGTT G 
TGGTTTTTGT TAGTTCACTT CAGCTTAATA AGTATTTATT G 
GTGTACCACT ACTTAGAGAT ATGTATCATA AAAATAAAAT CTGTAAACCA T 
TTATATAAAA TACATAATAT TTTTCAATTT TGAAAACTCT AATIGTCCAT TCTTGCTTGA 1620 
CTCTACTATT AAGTTTGAAA ATAGTTACCT TCAAAGCAAG ATAATTCTAT TTGAAGCATG 1680 
TGCTTCCTAA CATCCTTGGA CTGAGAAATT ATACTTACTT CTGGCATAAC 1740 
TATATATATT TTGGCTCAAA li 



I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNHV LPGERYLEKF YGLEINKLPV TKMKYSGMLM 
KEKIQEMQHF LGLKVTGQLD TSTLBMMHAP RCGVPDVHHF REMPGGFVJIR KHYITYRINN 
YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KIMTGMADIL WFARGAHGD FHAFDGKGGI 
LAHAFGPGSG IGGDAHPDED EPWTTHSGGT NLPLTAVHEI GHSLGLGHSS DPKAVMPPTY 
KYVDINTFRI. SADDIRGIQS LYGDPKBNQR LPNPDNSEPA LCDPHLSFDA VTTVGNKIPP 
FKDRFFWLKV SERPKTSVNL ISSLWETLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 
EPNYPKSIHS FGFPNFVKKl DAAVFNPRPY RTYPFVDHQY WRYDERRQMM D 
( YFFQGSNQPB YDFIiLQRITK TLKSHSWFGC 



3 GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 
GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 
GGGCTGGCTG GTCACCftGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 
GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGOSGGGfi GCAGCATGGA 
GCCTTCGGCT GACTGGCTGG CCACGGCCGC GGCCCGGGGT CGGGTAGAGG AGGTGCGGGC 
GCTGCTGGAG GCGGGGGCGC TGCCCAACGC ACCGAATAGT TACGGTCGGA GGCCGATCCA 
GQTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG GCGCGGAGCC 
CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 
CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 
GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 
GCGCGCGGCT GOSGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 
TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 
QATCaTCAGT CACCGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA C 
TTTOSTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG TCC" ™ " 
ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTT 
AAATGTAAAA AAGAAAAACA CCGCTTCIGC CTTTTCACTG 
AGCACTCACG CCCTAAGCGC ACaTTCATGT GGGCATTTCT T 
GAAGCTGTCG ACTTCXTGAC AAGCATTTTG TGAACTAGGG A 
CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 
ATTTTCATTC ATTCACTC 

Seq ID NOi 127 Protein sequence 
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TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG CTCTACACAA GCTTCCTTTC CGTCATGCCG 

CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 

TCTCACCCGA CCCGTGCACG ACGCTGCCCG GGAGGGCTTC CTGGACAOGC TGGTGGTGCT 

GCACCGGGCC G 



CTACAGGGCC ACAACTGCCC CCGCCACAAC CCACCCOSCT TTCGTAGTTT TCATTTAGAA 
AATAGAGCTT TTAAAAATGT CCTGCCTTTT RACGTAGATA TAAGCCTTCC CCCACTACCG 
TAAATGTCCA TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC 
CGCTTCTGCC TTTTCACTGT GTTGGAGTTT TCTGGAGTCSA GCftCTCAOSC CCTAAGCGCA 
CATTCATOIG GGCATTTCTT GCGAGCCTCG CAGCCTCOSG AAGCTGTOGA CTTCATGACA 
AGCATTTTGT GftACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT 
AGCAAATGGC AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTC 



ARLDVKDAWQ SO 



Seq ID NOt 130 DNA Bsqusnce 

Nucleic Acid Accession #i NM_058197.1 

Coding sequence: 272.. 646 

1 11 21 31 41 51 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 
TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGOSG TCCCTCCAGA 
GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCOSCCAGC ACCGGAGGAA GftAAGAGGAG 
GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCMTGCG CTOSGOGGCT GCGGAGAGGG 
GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCOS GCGGCGGGGA GCAGCATGGA 
GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCAOGG CCGCGGCCCG 
GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 
TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 
CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 
COSGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 
3 CTTGGGAAAC CAAGGAAGAG GAATGA( 
A GAAGATCTGA AGGGGGGAAC ATATTTGTAT T 
TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC G 



3 CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 
^ TTCATGTGGG CATTTCTTGC GAGCCTOSCA GCCTCCGGAR 
TTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 
CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 



MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALL EAGALPNAPN SXGRRPIQVG 
RRSAAGAGDG GRLWRTKPAG ELESGSASIL RKKGRLPGEF SEGVCMHRPP PGDALGAWET 
KEBE 
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GLAVWIPPHI HNKTSPIPXL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV PLGLISIDRV LKWKPPGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS EYIHKSSRQF 240 

ISQSSRKEKH HQSIRWVAV PPTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 
PLSAQJVCLD PIIYPPMC31S PSRRLPKKSH lETRSESIRS LQSVRRSEVR lYYDXTDV 



Seg ID HO: 13 S DMA sequence 
Nucleic Acid Accession #i NM_ 
Ccxiing sequences 1..1662 
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ATGCCGGTGC AGCTGACGAC AGCCCTGOST GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 
GTGCTGGGTG GCaTCCTGGC AGCCTATGTG ACGGGCTACC AGTTCRTCCA CACGGAAAAG 
CACTACCTGT CCTTCGGCCT GTAOSGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 
CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 



C GGTGGACTAC ATCCAGGTGT GCGACTCTGR CACTGTGCTG 
r GCACCATCGA GATGCTTOa GTCCTGGAGG AGGATCCCCR AGTAGGGGGA 
GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TAOSACrCAT GGATTTCTTT CCTGAGCAGC 
GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT T 



\ CCGGCACCTC 

TACTTCCCGC AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 
3 CAGAGATGAT C 



1^ CACAGCTTAT 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 
\ TATCTGGCCA TCATOSCCCG G 



WDGNRQEDA 7MLDIFHEVL GGTEQAGFFV WRSNFHEAGE 6ETEASLQEG MORVEDWHA 

STPSCIMQKW GGKREVMYTA PKALGDSVDY IQVCDSDTVL DEACTIEMLR VLEEDPQVGG 

VGGDVQILNK YDSWISFLSS WYWMAPNVE EACQSYPGCV QCISGPLGMY HNSLLQQPLE 

DWYHQKFLGS KCSFGDDRHL THRVLSLGYE TKYTAESKCL TETPTKYLRW LNQQTRWSKS 

YPREWLYNSL WPHKHHLWMT YESWTGPFP PPLIATVIQL PYRGRIWNIL LPLLTVQLVG 

IIKATYACPL RGNAEMIPMS LYSLLYMSSL LPAKIPAIAT INKSCWGTSG RKTIWNPIG 

LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML 

KKPEQYSLAP AEV 



3oeaBion"#: NM 001327.1 
2: 89. .631 

21 31 41 51 

C GCTGTGTGTA CCGAGAATAC GAGAATACCT aSTGGGCCCT GACCTTCTCT 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 

TGQCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG C 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG A 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG T 



CTGTCTCCAG 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT 
GCCTCCTCCC CTAGGGAATG GTCCCAGCTiC GAGTGGCCftG TTCATTGTGG GGGCCTGRTT 
65 GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO! 139 Protein Sequence 
Protein Accession H, NP_001318.1 



MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGF( 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAK: 
VLLKBFTVSG HILTIRLTAA DHRQLQLSIS 
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75 



TCTCTGAGflG CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 
TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 
TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCRGA 



CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 
10 GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT QCAGCTCTCX: 
ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 
GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCX3CTAAG CCCAGCCTGG CGCCCCTTCC 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACX3A GTGGOCAGTT CATTGTGGGG 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAftAG 



MOAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGEHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FI>PVFLAQAP SGQRR 

Seq ID NOt 142 DMA Sequence 

Nucleic Acid Acceeeion #> NM_130467.2 

coding seisuence: 246.. 638 



AGGCAGAGCT CTGCAAGGAG AGGTTGTGTC TTCGTTCTTT CCGCCATCTT CGTTCTTTCC 60 

AACATCTTCG TTCTTTCTCA CTGACCGAGA CTCAGCCGGT AGGTCTGCAG AGTGGTCTTC 120 

35 CTGGTAATTT AGTTGTGAGT GAATGTGTGG AGGAGCCAGC GGGCTTAGGA CAGGTCCTGT 180 

GGCRCAGTCC GTGGCTTTGA GGGftAAAGGG CCTCGCGGTG GTCCTCCGCC TTCCCCCAGG 240 

TCGTGATGCA GGCGCCATGG GCCGGTAATC GTGGCTGGGC TGGAACGAGG GAGGAAGTGA 300 

GAGATATGAG TGAGCATGTA ACAAGATCCC AATCCTCAGA AAGAGGAAAT GACCAAGAGT 360 

CTTCCCAGCC AGTTGGACCT GTGATTGTCC AGCAGCCCAC TGAGGAAAAA CGTCAAGAAG 420 

40 AGGRACCACC AACTGATAAT CAGGGTATTG CACCTAGTGG GGAGATCRAA AATGAAGGAG 480 

CACCTGCTGT TCRAGGGACT GATGTGGftAG CTTTTCAACA GGAACTGGCT CTGCTTAAGA 540 

TAGAGGATGC ACCTGGAGAT GGTCCTGATG TCAGGGAGGG GACTCTGCCC ACTTTTGATC 600 

CCACTAAAGT GCTGGAAGCA GGTGAAGGGC AACTATAGGT TTAAACCAAG ACAAATGAAG 660 
ACTGAAACXA AGftATATTGT TCTIATGCTG GAAATTTGAC TGCTAACATT C 
AGTTTTACAG TTTTC 

Seq ID NO I 143 Protein Sequence 



Coding s 



3 TGCTGTGTGG TTCCTGCCGT CCGGACl 
TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 
r ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCC — 



GCAGCTGCTC A3GAGGC 

GGGCAGGAGG T3GACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGR AGGTGAAAAG 420 
CAATCACAGT GTTAvW.GAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 480 
TTGTTCATTA AAATTCTCCC AATAAAGCT 



MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDBVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 
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Seq ID NO: 146 DMA 
Nucleic Acid Access 
Coding sequence; 9C 



ACAGCGGAGC GCAGAQTGAG AACCACCAAC CGAGGOSCCG GGCAGCGACC CCTGCAGOGG 
AGACAQAGAC TGAGCX3GCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 
C GCAGCCCGGG CCACCTCCAG G 




A ATACAGTACT G' 
TCTCTGGAGC CCCAGCACCC T 
AATTCTGCCA GGATTGTGCT T 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 
G GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT O 

C AGCTGCAAGC CATGTCCCTG TCATAACCGG TTCAGCTGCT 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

A GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

G TCAGOCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCOSGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGQGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGRAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGICCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAQATGG ATCAGTTTAT GCAGCAGCTT CAGAGRATGG 1980 

TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGCTGAG CAGGCCCTTC AGGACaTTCT GAGAGATGCC CAGATTTCAG 210 0 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGC(aAGGT GAGGAGCCAA GAGAACAGCT 2160 

45 ACCAGAGCCG CCTGGRTGAC CTCAAGATGA CTGTGGAAAG AGITCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGASTTOSG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCyiGAAAGC CACGTTGAGT 2400 

^ CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

50 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 252 0 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAQGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCOSCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAA(3VAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

55 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGO GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

G TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

C TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

G GGGAGCGCTG CTGCTGATGC ACAGAGGGCA RAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGMCTTGG 3180 



GGAAGTGGAA 
TGGATGCAGT flCAGATGGTG ATTACAGAAG 
65 CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 
r CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 
\ CAAGCATAGA TGGGATTCTG GCTGATGTGA 
AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 
r AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 
3 GGATGGGGAC ATTTGAACAT GTTTAATGGG 
r CCTGATCCCA T 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGtK A 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 
ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 
ATAQTCAACT TATTCTTTQA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 
ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 
ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 
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ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC AITCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 432 0 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGC3WA GCAAATGTTG GGAAAGTATT TACTTTTTOS 4440 



4E60 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 



TCTCTGCCCT Cf 

CATATGTTGC 



TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 



NGKSRQCIFD RELHEQTGNG F 
GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPG 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR S 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILE GAGLRITAPL 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFBYRRLL RNLTALRIRA TYGEYSTGYI 

30 DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 

GGACDPDTGD CYSGDBNPDl ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGH CDRLTGRCLK 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 

NCEHGAPSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 

35 QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVEDTH 

RLITQMQLSL AESEASLQNT HIPASDHYVG PNGPKSLAQB ATRLAESHVE SASNMEQLTR 

ETEDYSKQAL SLVRKALHBG VGSGSGSPDG AWQGLVEKL EKTKSLAQQIi TREATQAEIE 

ADRSYQHSLR LLDSVSRLQG VSDQSPQVEE AKHIKQKADS LSTLVTRHMD EFKRTQKNLG 

NWKEBAQQLL QNQKSaRBKS DQLLSRANLA KSHAQEALSM GNATPYBVES ILKNLREPDL 

40 QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKHG AGEALEISSE 

lEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKBLEFDTH MDAVQMVITE 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLBQKLSRA KTQINSQLRP 

MMSELEERAR CX3RGHLHLLE T5IDGILADV KNLENIRDNL PPGCYNTQAL EQQ 



50 I 1 I 1 1 1 

GGCACGAGGG AGAAGCGAGG TTCTCGTTCT GAGGGACAGG CTTGAGATCG GCTGRAGAGA 
GCGGGCCCAG GCTCTGTGAG GAGGCAAGGG AGGTGAGAAC CTTGCTCTCA GAGGGTGACT 
CAAGTCAACA CAGGGAACCC CTCTTTTCTA CAGACACAGT GGGTCGCAGG ATCTGACAAQ 

55 CAGCACTGAA GGAGAAGACC TGCCTGTGGG TCCCCATCGC CCAAGTCCTG CCCACACTCC 

a GTCATCATGC CICGAGCTCC AAAGCGTCAG C 



TGGCTGTGGA 



T TAGATCAATC 

AAAAGGAGGA GAGTCCAAGC ACCCTACAGG TCCTGCCAGfl 

3 ACIGATTTGG TGCAGTTTCT GCTCTTCAAG 
G AGAGTGTCAT 



65 TCCCTTTGTT GTTTAGTGAA GCCTCCGAGT GCATGCTGCT GGTCTTTGGC ATTGATGTAA 

AGGAAGTGGA TCCCACTGGC CACTCCTTTG TCCTTGTCAC C 



: CTGAGGAGGT C 
TGATGGGGCT GTATGATGGG ATGGAGCACC TCATTT^ 

a AGTACCGGCA GGTGCCTGGC A' 
3 CTCATGCTGA A 

TGAAATTTTT GGCCAAGGTA AATGGGAGTG ATCCAAGATC CTTCCCACTG T 

3 AGGACAGAAT TGCCACCACA GATGATACTA 



AAGTCAAATA CCACATGATT 
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MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA P 

-T PNPPQSAQIA C 

D LVQPLLFKYQ M 

ECMLLVPGID VKEVDPTGHS FVLVTSLGLT V 

L, NMMGLYDGME HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 

3 SDPRSFPLWY EEALKDEEER AQDRIATTDD TTflMASASSS 
ATGSPSYPE 



CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC C 
CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG 

: CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 



TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT G 

ACAACSCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 

TGAGCCTCCT GGCCGTCftTC TTAGCCCCCA GCCTGTGACC ITCCCCCCAG GGAAGGCCCC 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 

CATGGAATGC TGftTGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 

ACAGAGGATG CAGCCCCCAG CTGCftTGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 

GATTTCaCAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 
TAAATGATTT AftACC 

Seq ID NO: 151 Protein Sequence 



MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 
KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKIiHNAA PTRTALAHSA LSLGLALSLL 
AVILAPSL 



I 1 I I I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCT^ TGTCCGTTCC 

GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGCGTGAA AAAAATCAAA 
GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCKGGACC AGGTTCTTTT GCTGGGCTCC 
AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 
CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 
TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 
GTGAAAGCAA TGATCGAGAC TAAGACGGGT ATAATCCCIG AGACCCAGAT TGTGACTTGC 
AATGGAAAGA GACTGGAAGA TGGGRAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 
TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTOGC 
AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 
TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT CaGGTGGGATG QGTAGGATGA 
AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



MAPHASCLCV HVRSEEWDLM TFDANPYDSV KKIKEITi/RSK TI«'PVQDQVL LLGSKILKPR 
RSLSSYGIDK EKTIHLTLKV VKPSDEELPL FLVESGDEAK RHLLQVRRSS S 
TKTGIIPETQ I 



GTGGGTCACG TGAACCACTT TTCGCGCGAA A 
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TCGTGGTACT GCTATGGCGG AATCATCGGA ATCCTTCACC ATGGCATCCA GCCCGGCCCA 

GCGTCGGCGA GGCAATGATC CTCTCACCTC CAGCCCTGGC CGAAGCTCCC GGCGTACTGA 
TGCCCTCACC TCCAGCCCTG GCCGTGACCT TCCACCATTT GAGGATGAGT CCGAGGGGCT 



CATGCGGCAG CGTGACCGGG AGGCTGGCCG GGGCCTGGGC CGCATGCGCC GTGGGCTCCT 



G GACGGCGAGG AGGACGAGGA GATGATCGAG AGCATCGAGA ACCTGGAGGA 
TCTCAAAGGC CACTCTGTGC GCGAGTGGGT GAGCATGGCG GGCCCCCGGC TGGAGATCCA 
A CGTCGACAGC CACGGCCACA A 



GAGGCACCGG CGGAGCTGCT 
GGTACTGGCC A 



CCCCAGCTCA GCATGGTCAA GTACAACTGC AACAAGTGCA ATTTCGTCCT 



GTCGGCCGGC CCCTTTGAGG TCAACATGGA GGAGACCATC TATCAGAACT ACCAGCGTAT 1200 

CCGAATCCAG GAGAGTCCAG GCARAGTGGC GGCTGGCCGG CTGCCCCGCT CCftAGGACGC 1260 

CATTCTCCTC GCAGATCTGG TGGACAGCTG CAAGCCAGGA GACGAGATAG AGCTGACTGG 1320 

CATCTATCAC AACAACTATG ATGGCTCCCT CAACACTGCC AATGGCTTCC CTGTCITTGC 1380 

CACTGTCATC CTAGCCAACC ACGTGGCCAA GAAGGACAAC AAGGTTGCTG TAGGGGAACT 1440 

GACCGATGAA GATGTGAAGA TGATCACTAG CCTCTCCAAG GATCAGCAGA TCGGAGAGAA 1500 

GATCTTTGCC AGCATTGCTC CTTCCATCTA TGGTCATGAA GACATCAAGA GAGGCCTGGC 1560 

TCTGGCCCTG TTCGGAGGGG AGCCCAASAA CCCAGGTGGC AAfiCaCAAGG TACGTGGTGA 1620 

TATCAACGTG CTCTTGTGCG GAGACCCTGG CACAGCGAAG TCGCAGTTTC TCAAGTATAT 1680 

TGAGAAAGTG TCCAGCCGAG CCATCTTCAC CACTGGCCAG GGGGCGTCGG CTGTGGGCCT 1740 

CACGGCGTAT GTCCAGCGGC ACCCTGTCAG CAGGGAGTGG ACCTTGGAGG CTGGGGCCCT 1800 

GGTTCTGGCT GACOSAGGAG TGTGTCTCAT TGATGAATTT GACAAGATGA ATGACCAGGA 1860 . 

CAGAACCAGC ATCCATGAGG CCATGGAGCA ACAGAGCATC TCCATCTCBA AGGCTGGCAT 192 0 

CGTCACCTCC CTGCAGGCTC GCTGCACGGT CATTGCTGCC GCCAACCCCA TAGGAGGGCG 1980 

CTACGACCCC TCGCTGACTT TCTCTGAGAA CGTGGACCTC ACAGAGCCCA TCATCTCACG 2040 

CTTTGACATC CTGTGTGTGG TGAGGGACAC CGTGGACCCA GTCCAGGACG AGATGCTGGC 2100 

CCGCTTCGTG GTGGGCAGCC ACGTCAGACA CCACCCCAGC AACAAGGAGG AGGAGGGGCT 2160 

GGCCAATGGC AGCGCTGCTG AGCCCGCCAT GCCCAACACG TATGGCGTGG AGCCCCTGCC 2220 

CCAGGAGGTC CTGAAGAAGT ACATCATCTA CGCCRAGGAG AGGGTCCACC CGAAGCTCAA 22 80 

CCAGATGGAC CAGGACAAGG TGGCCAAGAT GTACACTCAC CTGAGGA.!y\G .nATCTATGGC 2340 

GACAGGCAGC ATCCCCATTA CGGTGCGGCA CATCGAGTCC ATGATCCGCA TGGCGGAGGC 2400 

CCACGCGCGC ATCCATCTGC GGGACTATGT ■ GATCGAAGAC GACGTCAACA TGGCCATCCG 2460 

CGTGATGCTG GAGAGCTTCA TAGACACACA GAAGTTCAGC GTCATGCGCA GCATGCGCAA 2520 

GACTTTTGCC CGCTACCTTT CATTCCGGCG TGACAACAAT GAGCTGTTGC TCTTCATACT 2S80 

GAAGCAGTTA GTGGCAGAGC AGGTGACATA TCAGCGCAAC CGCTTTGGGG CCCAGCAGGA 2640 

CACTATTGAG GTCCCTGAGA AGGACTTGGT GGATAAGGCT CGTCAGATCA ACATCCACAA 2700 
CCTCTCTGCA TTTTATGACA GTGAGCTCTT C 



G GGGTGGTCAG TGCCCTCTGT GCTTTATGGA C 
TGAACTCGGG QTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 
TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT C 



TTGGATCAGA G 
ATGGATGTCA GGAGAGCTGC T 
TGCCTTTGGC C 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 3300 
GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCaCAGTTA TCAGCTGCCA 3360 
TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGOTT TCTGTAGTTT 3420 

TAATTTTTAA 



3 PRLEIHHRPK NPLRTHVDSH GHNVFKERIS DMCaCEMRESL WNVEDLAAR 240 

EHVIAYPLPE APAELLQIPD EAALEWLAM YPIODRITNH IHVRISHLPL VEELRSLRQL 300 

HLHQLIRTSG WTSCTGVLP QI.SHVKYNCN KCNPVLGPPC QSQNQEVKPG SCPECQSAGP 360 

PEVNMEETIY QNYQRIRIQE SPGKVARGRL PRSKDAILIA DLVDSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIL ANHVAKKDNK VAVGELTDED VKMITSLSKD QQIGEKIPAS 480 

lAPSIYGHED IKRGLALALF GGEPKNPGGK HKVRGDINVL LCGDPGTAKS QFLKYIEKVS 540 

SEAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLRD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIfiAA NPIGGRYDPS LTFSEHVDLT EPIISRPDIL 660 

CWRDTVDPV QDEMLAEPW GSHVRHHPSH KEEBGLAHGS AAEPAMPNTY GVEPLPQEVL 720 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

HLRDYVIEDD VNMAIRVMIiE SFIDTQKFSV MRSMRKTPAR YLSFRRDNNE LLLPILKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QIHIHNLSAF YDSELPRMNK FSHDLKRKMI 900 
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LQQF 



c acacAAccia 

C GGCCATGCCT TCCCGGGCTG AGGACTATGA 
GSAGGAAGAG 

GGAAAGAACI TGACTATGGC TCCATGACAG AAGCTGAGAA 



60 



: AATACTTAGA 
A GTTGACTCTG GCCCTGAAGG AATGCCACAG A 

FTTTCCT GGATGGCAAG CAAAACGTCA 
AGCTTGGAC3A CTITGGGCTA GCTAGAATAT TAAACCfl 
TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA T 
AATCAGATAT CTGGTCRTTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA A 
TTACAGCTTT TAGCCAGASA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 
TTCCATACOS TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 
ATTACCATOS ACCTTCTGTT GAAGASATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 
CAGACGAGCa AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGRGCA GASAGAACAG GAGCTTTGTG 
TTCGTGAGSG ACTAGCAGAG GACABACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 
GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 
G AAAGTTCATT TCAGTGGGGA AAGTASAGAG AACATCATGA 
A TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 
TGAAAAGCAG ACAGATCCTG GGCATGa3CT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 
AGGATGTAAT ATIACCAACC TTTAAAGACT GATATTCAAA TGCIGTAGTG TTGAATACTT 
GGCCCCATGA GCCATGCCTT TCTOIATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 
TTCTTCAGCA ACTATTiHAC AAAATGTTCA CATTTAATTT TTCITTCTTC TTTTAAGAAC 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 
GAACaTGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 
TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACSiAT 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTIATAAAT ATTCCATTGC 
TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCftTTTA CCTACTTCTT TCCCACTAAA 
TAAAAGAATT CTTCAGTTA 



I I I 

3 RCQKIRRKSD GKILVWKELD YGSMTEAEKQ MLVSEVNLLR 
rVMEYCE QGDLASVITK GTKERQYIjDE EFVLRVMTQL 
R SDGGHTVLHR DLKPANVFLD GKQNVKLGDF GLARILNHDT SFAKTFVGTP 
YYMSEEQMNR MSYNEKSDIW SLGCLLYELC ALMPPFTAFS QKELAGKIRE GKFEEIPYRY 
55 SDELNEIITR MLNLKDYHRP SVEEILENPL lADLVADEQR RNLERRGRQL GEEEKSQDSS 
PVLSELKLKE IQLQEREEAL KAREERLEQK EQELCVRERL AEDKLARAEN LLKNYSLLKE 

RKPLSLASNP ELLNLPSSVI KKKVHFSGES KENIMRSEMS ESQLTSKSKC K. 

QLRAQALSDl EKNYQLKSRQ ILGMR 



I I I I 

TGAGTTTGCC CCCTTACCCC CATCCCAGTG AATATTTGCA A 
ATTGTCACAC CTGGGTQGGG AACATGCTAC TGGCATCTAA T 
TGCTAAACAT CTTTCAACGC ACAGGACAGA GCCCCACAAA AGAGAATTAT C 
ATGTCCATAA CACTGCTGTT GAGAAAACCT ACCGCAGGAT CTTACTGGGC TTCATAGGTA 
AGCriGCCTT TGTTCTGGCT TCTGTAGATA TATAAAATAA AGACACTGCC CAGTCCCTCC 
CTCAACGTCC CGAGCCMGG CTCAAGGCRA TTCCAATAAC AGTAGAATGA ACACTAAATA 
TTGATTTCAA AATCTCAGCA ACTAGAAGAA TGACCAACCA TCCTGGTTGG CCTGGGACTG 
TCCTAGTTTT AGCATTGAAA GTTTCAGGTT CCAGGAAAGC CCTCAGGCCT GGGCTGCTGG 
TCACCCTAGC AGCTGAGGGA CTCITCAATA CAGAAITAGT CTTTGTGCAC TGGAGATGAA 
TATACTTTAA TTTGTAACAT GTGAAAACAT CTATAAACAT CTACTGAAGC CTGTTCTGTC 
TGCACCGACA TTTTCATTGA GTAOSGATTC TTCCTACCAG ATACAGCTGC TCTACAACTT 
TCGAGGGCTG GTATAAAACT AGCITTTACC TATTTITAAA AAITACATGA ATAGTAAAAA 
CTTGGATTAA CCCAGTATTC GGGTATTTTC AATTTCCTTG GGAGCTTAGA GGACGGACAA 
A TCAAATATAY GCTATTGTTT ACATATGAAG ATAACCACAT 



284 



wo 03/003906 



PCT/LS02/21338 



AATACTATAR 



Seq ID NOi 159 DNA S 
Nucleic Acid Access!' 
Coding sequence: 264 



I I I I I 

CCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCCACGCA " 
3 ACTAATTTTC CTTAAAATTT AGACTTGCAC 
h TACATCAAGT 
C CAGGGTGGTT 
3 CAAGGAGCTC ATCATGCTGA CAGAAGTCAT GGAGGTCTGG 
G CCTGCTTCCT 



3 CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG G 

h CATCCCAATG TCTGATTCCC TTTACAGGCA TGACAGC 

ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCACS 

ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCOS CTGGACTATG 

AGAACATAAA GGftflATCaCA GRTTATGTCA ATOTCAATCC AGAAAGACAC AAGCCCAGTT 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAR GTGGCCATGT 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 

GGAAATGCCa TTTTTCCCCC TTAAAC3U«3a CATGGGGCTC ACS^OICTAT GGAGACAGGC 

CAAAAAGAAT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA CCCaTGGACT 

CCTGGTCTGT ACCCAAAAAA GCTGTTCGTT CCTCAAAAAC AAAAACAAGG CTTGGCTGGG 

AAAACAGGCC AATGCCCCGG CAAGAAAGGT TGBGATCaGA TGTTAGGAAG AACTTTCAGG 

TAAAGTATGA GAACTATGGA GTCCATCAGC AGAGATAGTA GTGAAGTCTC TCCCCAGGGA 

AAATTTTAAA AAGGTTGAAT C3M3CTGTTGI AGAGTTCTAT TTGGCAATCT CATGGTTAAA 

TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 

AAAATAGCAA CCACCACCA 



H GLVIAWSLF LQACFLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 
HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPAOQATED VDinQWFSD 
PGELKNDSPL DYENIKEITD YVNVNPERHK PSFWVFVNPA LSEPAEYDQV AM 




TACCTTGTTT TCTGGACAGT G' 
CTGCGGATCT A 

TCCATCAGCC GCCGGAGGAC ACCCATGAAG CTAATGAAGA O 
GCGTTTGTGG TATGCTGGAC O 

AGGOkQTGTG GCGTGCAGCA TGTGAAAAGG TGGTTCCTGC TGCTGGCGCT GCTCAACTCC 
GTCGTQAACC CCATCATCTA CTCCTACAAG GACGAGGACA TGTATGGCAC CATGAAQAAG 
ATGATCTGCT GCTTCTCTCA GGAGAACCCA GAGAGGCGTC CCTCTCGCAT CCCCTCCaCA 
GTCCTCAGCA GGAGTGACAC AGGCAGCCAG TACATAGAGG ATAGTATTAG CCAAGGTGCA 
GTCTGCAATA AfiAGCACTTC CTAAACTCTG GATGCCTCTC GGCCCACCCA GGTGATGACT 



MHECHYDKHM DFFYBRSNTD TVDDMTGTKL VIVLCVGTFF CLFIFFSNSL VIAAVIKNRK 
PHFPPYYLLA NLAAADPFAG lAYVFLMFNT GPVSKTLTVN RWFLEQGLLD SSLTASUTNL 
bVIAVERHMS IMEMRVHSNL TKKRVTLLIL LVMAIAIFMG AVPTLGWNCL CNISACSSLA 
PIYSRSYLVF WTVSNLMAFL IMVWYLRIY VYVKRKTNVL SPHTSGSISE RRTPMKLMKT 
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CAGTCGCGCG CGGTGCAGTC GGGAGGTGGA GGCACCGGCT GCATTGTTTT CGGGATCGAG 



CCCACGAAGT TTTGAATATT TGTTTTCCTT A 
TGGAAAGAGT TTCCTTTGTA AGTGTTCCTT TATTGAAATC TACAACGAGC AGATATATGA 
TCTACTGGAC TCTGCATCX3G CTGGRCTGTA CTTAAGGGAG CATATCAAGA AGGGAGTCTT 



rCATGCCGTC TTTACAATTA CAATAGAGTC AATGQAQAAA AGTAATGAGA TTGTGAATAT 840 



TGCAGAAGGG ATGAGATT6A AGGAAGCAGG TAACATAAAT CGATCATTGA GCTGCCTGGG 960 

CCAAGTGATT ACftGCACTTG TCX3ACGTGGG TAATGGAAAA CAGAGACATG TTTGCTACAG 1020 

AGACTCCAAA CTTACCTTCT TACTAOSGGA TTCCCTTGGA GGTAATGCCA AAACAGCCAT lOBO 

AATTGCAAAT GTTCATCCTG GATOCAGGTG TTTTGGGGAA ACCCTATCAA CACTTAACTT 1140 

TGCTC3UUM3A GCCAAGCTGA TTAAAAACAA GGCAGTAGTA AATGAAGACA CCCAAGGAAA 1200 

TGTGAGCCAG CTCCAAGCTG AAGTGAAGAG GCTCAARGAA CAACTGGCGG AGCTTGCTTC 1260 

AGGACAGACA CCACCAGAAA GCTTCCTGAC CAGAGACAAA AAGAAGACTA ACTATATGGA 1320 

GTATTTCCAG GAAGCAATGT TATTCTTTAA GAAATCTGAA CAGGAAAAGA AGTCTCTGAT 1380 

AGAAAAAGTT ACCCAATTAG AAGACCTCAC CCTCAAAAAG GAAAAATTTA TTCAATCTAA 1440 



rGCTATGGA 1620 



TAATTCAAAG CAAGAATATG AAGAATTCAA 



AACAAAAGCC TGCAAGCGGC AAGAAGTTTC TCAGCTGAAT A 
TAAGATTATA ACTACACCAA CCAAGGCCTA CCAACTTCAT T 

T ATACACTCAG AATTCTAGCA TATTAGATAA 

TGATATATTA AATGAGCCAG T 



AGAGCATAAA 
AATGCAGGAG 

CTCACAGTTG AATGTCCTTG 




TGAAAGACAC ATGCATGTAC AGCTTCAATT A 
GCTTQAGAGC AAAGCCTGCC T, 

TCAGATTGAC CAACTTTCAA GAAACCTCCA AAACTTCAAA A 
ATCTGATCTG AATAATTTGA TGGAGCTTCT T 



GGCTGTACGT CAGGAGAAAC AGAAAGAGAC GGCCAAGTGT GAGCAGCAGA TGGCAAAAGT 2940 

60 ACAGAAACTA GAAGAGAGCT TGCTTGCTAC TGAAAAAGTG ATCAGTTCCC TGGAAAAGTC 3000 

TAGAGATTCT GATAAGAAAG TTGTAGCTGA CCTCATGftAC CAGATCCAGG AGCTAAGAAC 3060 

ATCGGTCTGT GAGAAAACAG AAACTATAGA CACCCTGAAA CAAGAACTGA AGGACATAAA 3120 

TTOCAAATAC AACTCTQCTT TQaTTQACAG AOAAQAGAGC AQAaTOTTOA TCAAOAAGCA 3180 

GGAAGTGGAT ATTCTGGATC TGAAflGAAAC CCTTAGGCTG AGAATACTTT CTGRGGACAT 3240 

65 AGA6AGGGAT ATQCTCTGT6 AGGACCTGGC TCATGCCACT QAGCAGCTGA ACATGCTCAC 3300 

AOSGGCCTCA AAAAAACACT aaGGGCTaCT QCAGTCTGCC CflOGAAOAAC TGACCAAGAA 3360 

GGAAGCCCTG ATTCAGGAAC TTCAGCACAA GCTAAACCAA AAGAAAGAGG AAGTAGAACA 3420 

GAAGAAGAAT GAATATAACT ICAAAATGAG GCAACTAGAA CaTGTGATGG ATTCTGCTGC 3480 

TGAGGATCCC CAGAGTGCTA AGACACCACC TCACTTTCAA ACACATTTGG CAAAACTCCT 3S40 

70 GGAAACACAA GAACAAGAGA TAGAAGATGG AAGAGCCTCT AAGACTTCTT TGGAACACCT 3600 

TGTAACAAAG CTAAATGAAG ACAGAGAAGT CAAAAATGCT GAAATCCTCA GAATGAAGGA 3660 

GCAGTTGCGT GAAATGGAAA ACCTACGCCT GGAAAGTCAG CAGTTAATAG AGAAAAACTG 3720 

GCTCCTGCAA GGTCAGCTGG ATGATATTAA AAGACAAAAG GAAAACAQTG ATCAGAATCA 3780 

TCCAGATAAT CAACAGCTGA AGAATGAACR AGAAGAAAGT ATCAAAGAAA GACTTGC3UIA 3840 

75 AAGTAAAATA GTTGAAGAAA TGCTGAAAAT GAAAGCAGAC CTAGAAGAAG TCCftAAGTGC 3900 

CCTTTACAAC AAAGAGATGG AATGCCTTAG AATGACTGAT GSAGTCGAAC GAACCCAAAC 3960 

TTTGGAGTCT AAAGCATTCC AGGAAAAAGA ACAACTGAGA TCAAAGCTGG AAGRAATGTA 4020 
TGAAGAAAGA GAGAGAACAT CCCAGGAGAT GGAAATGTTA AGGAAGCAGG 
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h AATGGAAAGT TGGTAGGTCA CCAAAATTTG CATCAGAAGA T 

AGTGCGACTA AAGAAGGAAA ATGTCRGGCT TGCTGAGGAG ACAGAAAAGT TGCGTGCCGA 4200 

\ AGAAAAGAAG TGAATCTTGA GGATTCCGGT CAGCTACCTA 4260 

r TGTTTGAAGA TGTTTCTTCT CTTTTACAAG TAAGACCTAC TCCIGGCCAC 4320 

TTAGGAGAGC TGAATTTATG GACCTTAATT ATTAAATGTT TATAAGGTGG TGGTAACCAC 4380 

CTCAAGTTTC TGATGAACAT TCTGCATCCA TATACACCCT GTGACAGTCA GCAGTCTGCT 4440 

C: CTACTTCAAG GCTTTGAATC AACTTAAGGG AAAACCTTTT GTCTTTGTAA 4500 

\ GGTTTACAGT GGACATTAGC CAGATCATTT TCTTCTTAGA 4560 

A ATCTCCTTTG ATTCTTATGG AAGTTCTAAC AATATATGGT GGTTCCAACA 4620 

C TGACTTAGTA GCAGGTACAA GAAGCAAACT TGTTAATATA 4680 

C TTTAGGTATT TTACTTGAGC ATTTTCCATG ACTGTAAATA 4740 



I I I I I I 

[■lAPGCKTELR SVTNGQSNQP SNEGDAIKVF VRIRPPAERS GSADGEQNLC LSVLSSTSLR 
LHSNPEPKTP TFDHVADVDT TQESVFATVA KSIVESCMS 
GPSESDNPSH NLRGVIPRSF EYLPSLIDRE KEKAGAGKS 
ASAGLYLREH IKKGVPWGA VEQWTSAAE AYQVLSGGK 
TITIESMEKS NEIVNIRTSL LNLVDLAGSE RQKDTHAEGM RLKEAGNINR SLSCLGQVIT 
ALVDVGNGKQ RHVCYRDSKL TFLLRDSLGG MAKTAI HPGSRCFGET LSTLNFAQRA 

KLIKNKAWN EDTQGNVSQL QAEVKRLKEQ LAELASGQTP PESPLTRDKK KTNVMEYFQE 
AMLPPKKSEQ EKKSLIEKVT QLEDLTLKKE KFIQSNKMIV KFREDQIIRl, EKLHKESRGG 
FLPEEQDRLL SELRNEIQTL RBQIEHHPRV AKYAMENHSL REENRRLRLL, EPVKEAQEMD 
AQTIAKLEKA PSEISGMEKS DKNQQGPSPK AQKEPCLFAN TEKLKRQLLQ IQTELNNSKQ 
EySEPKBLTR KRQLELESEL QSLQKANLNL ENLLEATKAC KRQEVSQLMK IHAETLKIIT 
TPTKAYQLHS EPVPKLSPEM GSPGSLYTQN SSILDMDILM EPVPPEMNEQ AFEAISEELR 
TVQEQMSALQ AKLDEEEHKN LKLQQHVDKL EHHSTQMQEL FSSERIDWTK QQEELLSQLN 
VLEKQLQETQ TKNDPLKSEV HDLRWLHSA DKELSSVKLE YSSFKTNQEK EFNKLSERHM 
HVQLQLDNLR LENEKLLESK ACLQDSYDNL QEIMKPEIDQ LSRNLQNFKK ENETLKSDLN 
NLMELLEAEK ERNNKLSLQP EEDKENSSKE ILKVLEAVRQ EKQKETAKCE QQMAKVQKLE 
ESLLATEKVI SSLEKSRDSD KKWADLMNQ IQELRTSVCE KTETIDTLKQ ELKDINCKYM 
SALVDREESR VLIKKQEVDI LDLKBTLRLR ILSEDIERDM LCEDLAHATE QLNMLTEASK 
KHSGLLQSAQ EELTKKEALI QELQHKLNQK KEEVEQKKNE YNPKMRQLEH VMDSAAEDPQ 
SPKTPPHPQT HLAKLLETQE QEIEDGHASK TSLEHLVTKL NEDREVKNAE ILRMKEQLRE 
MENLRDESQQ LIEKNWLLQG QLDDIKKQKE NSDQNHPDNQ QLKNEQEESI KERLAKSKIV 
EEMLKMKADL EEVQSALYNK EMECLRMTDE VERIQTLBSK APQEKEQLRS KLBEMYEERE 
RTSQEMEMLR KQVECLAEEN GKLVGHQNLH QKIQYWRLK KEMVRLAEET EKLRAENVPL 
KEKKRSES 



I I I I I I 

TITTTTTTTT TTTTTTCACA TGCTGATGTT GCTGCCGATG ATTTCMCGC CTGGCTTTGA 

GATTCCGTGA GTAGTCTTGA ATAATTTAAA ATTCGAAAAT CAAATTCTAC TTATTTTCTC 
TTAATGCTAT TGTATTTCCT AATTCTCAGC TTTAACATGT AAGAAAGTAC TTTCGCTAGG 
GGTCTTAATT GAATGGTGGG GTCGAGATGA CTGCGTCAGA ATTAAATCTC TGGAAGACCT 
CTGAGCTCCT TTTAAAATCA TCAACAAGCG AAAATCCTTA TCAATAGCGA TGTGGGAATG 
CATTAGGTAC A 
AATTTAATAA G 
TGAAAATATG A 

AAACTCAGAG GTAAACTTGT GAATACATAA ATCTAAATAA GTCAGTTACC A 
TACGTGATCC T 

TTTCTTTCCC TCTTCCCAAG ATTATCCAAG GCAGTTCCAA TACGCGTTTT CAATAATGGT 
60 AATTAAACTT TTGGAGGGTA ATCGCCAGCT GTTTCCAGAA AAATACTTTT AATAGGTGGT 
AGTCCTCCCT GTAACTGGGT TTTCCCGGGA ATCCGGGGGG GGCATGTTGT TTTTATTGTC 
GCCATTTTTG TTTCGTCTTT TTTGTTTTGT TGGTGGTCAG GTTTTCTTTT CATAGCGGGG 
GAGAAGATTT TGGTCGGACT AGCGCCATCC AGCGGGTTTA GAGAAGGAGC ACACTTITCC 
3 GCTTTTGGAT GAACAATGAA 



1 1 I I I 1 

CCGTTCCGCG CTCTGGCGGC TCCTCCCGGG CGATGCCTCC GCTCTGGGCC CTGCTGGCCC 

TCGGCTGCCT GCGGTTCGGC TCGGCTGTGA ACCTGCAGCC CCAACTGGCC AGTGTGACTT 

TCGCCACCAA CAACCCCACA CTTACCACTG TGGCCTTGGA AAAGCCTCTC TGCATGTTTG 

ACAGCAAAGA GGCCCTCACT GGCACCCACG AGGTCTACCT GTATGTCCTG GTCGACTCAG 

CCATTTCCAG GAATGCCTCA GTGCAAGACA GCACCAACAC CCCACTGGGC TCAACGTTCC 

TACAAACAGA GGGTGGGAGG ACAGGTCCCT AC3UIAGCTGT GGCCTTTGAC CTGATCCCCT 

GCAGTGACCT GCCCAGCCTG GATGCCATTG GGGATGTGTC CAAGGCCTCA CAGATCCTGA 
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AGCTCACCCC ATACTCGACG ATCGACACGT GGCCAGGCCG GCGGAGCGGA GGCATGATCG 

TCATCACTTC CATCCTGGGC TCCCTGCCCT TCTTTCTACT TGTGGGTTTT GCTGGCGCCA 

TCACTCAGGA GGCTGTTCCC AAGTCGCTGG GGGCCTCGGA GTCTTCCTAC ACGTCCGTGA 

ACCGGGGGCC GCCACTGGAC AC3GGCTGSGG TGTATTCCAG CAAGCTCCAA GACTGAGCCC 

AGCACCACCC CTGGGCAGCA GCATCCTCCT CTCTGGCCTT GCCCCAGGCC C 

GGTTGTCACA CCCTGACTTC AC30GAACGTG AAACAGGGCT TGTCCCTCCA A 
AACCCTTAftT AAAATCTTCT GATGAGTTCT A 



1 11 21 31 41 51 

I I I I I I 

MPPLWALLAL GCLRFGSAVN LQPQLASVTP ATNNPTLTTV ALEKPLCMFD SKJ 
VYLWLVDSA ISRNASVQDS TNTPLGSTPL QTEGGRTGPY KAVAPDLIPC SDLPSLDAIG 
DVSKASQILN AYLVRVGANG TCLWDPNFQG LCNAPLSAAT EYRPKYVLVN MSTGLVEDQT 
LWSDPIRTNQ LTPYSTIDTW PGRRSGGMIV ITSILGSLPP PLLVGPftGAI ALSLVDMGSS 
DGETTHDSQI TQEAVPKSLG ASESSYTSVM RGPPLDRAEV YSSKLQD 



Seg ID NOi 168 DNA Sequence 

Nucleic Acid Accession #: NM_005672.1 

coding sequence: 18.. 389 



: AGTGACCATG AAGGCTOTGC TGCTTGCCCT GTTGATGGCA GGCTTGGCCC 

TGCAGCCAGG CACTGCCCTG CTGTGCTACT CCTGCAAAGC CCAGGTGAGC AACGAGGACT 

GCCTGCAGGT GGAGAACIGC ACCCAGCTGG GGGAGCAGTG CIGGACCGCG CGCaTCCGCG 

CAGTTGOCCT CCTGACCGTC ATCAGCAAAG GCTGCAGCTT GAACTGCGTG GATGACTCAC 

AGGACTACTA CGTGGGCAAG AAGAACATCA CGTGCTGTGA CACCGACTTG TGCAACGCCA 

GCGGGGCCCA TGCCCTGCAG CCGGCTGCCG CCATCCTTGC GCTGCTCCCT GCACTCGGCC 

TGCTGCTCTG GGGACCCGGC CAGCTATAGG CTCTGGGGGG CCCCGCTGCA GCCCACACTG 

GGTGTGGTGC CCCAGGCCTT TGTGCCACTC CTCACAGAAC CTOGCCCAGT GGGAGCCTGT 

CCIGGTTCCT GAGGCACATC CTAACWCAAG TTTGACCATG TATGTTTGCA CCCCTTTTCC 

CCNAACCCTG ACCTTCCCAT GGGCCTTTTC CAGGATTCCN ACCNGGCAGA TCAGTTTTAG 

TGANACANAT CCGCNTGCAG ATGGCCCCTC CAACCNTTTN TGTTGNTGTT TCCSTGGCCC 

AGCATTTTCC ACCCTTAACC CTGTGTTCAG GCACTTNTTC CCCCAGGAAG CCTTCCCTGC 

CCACCCCATT TATGAATTGA GCCAGGTTTG GTCCGTGGTG TCCCCCGCAC CCAGCAGGGG 

ACAGGCAATC AGGAGGGCCC AGTAAAGGCT GAGATGAAGT GGACTGAGTA GAACTGGAGG 

ACAAGAGTTG ACGTGAGTTC CTGGGAGTTT CCAGAGATGG GGCCTGGAGG CCTGGAGGAA 

GGGGCCAGGC CTCftCATTTG IGGGGNTCCC GAAIGGCAGC CTGAGCACAG CGTAGGCCCT 
TAATAAACAC CTGTTGGATA A 



I I I I I 1 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TOCTGATTTT 
TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA T 
ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG A 
GGCTGCCTGG ATCGGCATAT TTGTGGGCRT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 
TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 
AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG A 
ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC C 
TGATGACCAG TGGAAAAACA ATGGAGTCaC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 
CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAARRA TACACATCTG C 
TGAGAATAAT GATOCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA A 
AGAACKTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC A 
CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACMCC TGGGGGGTTG CCTGGTTTGG 
ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 
AATTGAATAT TAAGAA 
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PQGLLIPGNV IIGCCGIflLT ABCIPFVSDQ HSLYPLLEAT DNDDIYGAAW 
P CLSVLGIVGI MKSSRKILLA yPILMPIVYA PEVASCITflA TQRDPFTPNL 
PLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW tJRLMLQDNCC 

DADYPWPRQC C 



1 11 21 31 41 51 

ATGAGGCTCC AAAGACCCOS ACAGGCCCCG GCGGGIGGGA GGCGCGCGCC CCGGGGCGGG 
CGGGGCTCCC CCTACCGGCC AGRCCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 

20 AAGGGCGGOO AGCGGGCGCC GCGCGCTGAC CCTCCCTGGG CACOSCTGGG GACGATGGCG 
CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACKC CAACCTGACT 
GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGJ 
TOTCRTGTTT GTGAGAOAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG G 
ACAGAGCCAT ACTGCGTTAT AGCGGCQGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 

25 AAGCAGTGCT COSCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 
CTCCTGGAAO AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 
TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAASGAAT ATGCTGGGAG CATGGGTGAG 
AGCTGTGGIG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 



MRLQRPRQAP AOGRRAPRGG RGSPYRPDPG RGARRLRRPQ KGGEGAPHAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 
TEPYCVIAAV KIPPRPPMVA KQCSAGCAAM ERPKPEEKRP LLEEPMPPFY LKCCKIRYCN 
LEGPPINSSV PKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 



CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGAT 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC A 
ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 
AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAAT3CATG 
CTGTCAGCGT TGGTATTTCA CATTCAATGG A 



CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT 



TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 
TGGTTAGAAT ACTTTCTTCA TAGTCACaTT CTCTCAACCT ATAATTTGGA ATATTGTTGT 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TITTAAAAAA ATATAAAAGC TACCAATCTT 
TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 
CAACCTTAAA AAAAAAAAAA AAAA 



MRPQQPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGPKGEK GBCLRESPEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTPTK MRSNSALRVL PSGSLRLKCR NACCQRWYPT FNGAECSGPL PIEAIIYLDQ 
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L CEGIGAGLVD VAIWVGTCSD yPKGDASTGW NSVSRIIIEE 240 
LPK I 

Seg ID NO! 176 DNA Sequence 



ATGGCGAGGA 



GAATCTGGCA TTAATGTTGA CTTGGCA.\TT TCCACACGGC A 

T ATGGAGAAAA TAATTC7TTG TCAGTTGAAG GGTTCAGAAA ATTACTTCAA 



C ATCACTCAGA C' 
GAGCATCACT CTGACCATGA TCATCACTCC CACCAT.AATC ATGCTGCTTC T 
AAGCGAAAAG CTCTTTGCCC AGACCATGAC TCAGATAGTT CAGGTAAAGA TCCTAGAAAC 
AGCCAGGGGA AAGGAGCTCA CCGACCAGAA CATGCCAGTG G 
G CTAGTGAAGT G 

lAACrCT TCCCCAAAGA TGTAAGCAGC 
TCCACTCCAC CCAGTGTCAC ATCAAAGAGC CGGGTGAGCC GGCTGGCTGG TAGOAAAACA 
C TTTATGTATT CCAGAAACAC AAATGAAAAT 
iCATCTC ATGGCATGGG C 

CCGCTGAATG CAACAGAGTT CAACTATCTC T' 

G AAGGCTGAAA TCCCTCCAAA GACCTATTCA 9S0 

A GCCATTTCCA TCATCAGTTT CCTGTCTCTG 102 0 

CIGGGGGTTA TCTTAGTGCC TCTCATGAAT CGGGTGnTT TCAAATTTCT CCTGAGTTTC 1080 

CTTGTGGCAC TGGCCGTTGG GACTTTGAGT GGTGATGCTT TTTTACACCT TCTTCCACAT 1140 

TCTCATGCAA GTCACCACCA TAGTCATAGC CATGAAGAAC CAGCAATGGA AATGAAAAGA 1200 

GOACCACTTT TCAGTCATCT GTCTTCTCAA AACATAGAAG AAAGTGCCTA TTTTGATTCC 1260 

AOSTGGftAGG GTCTAACAGC TCTAGGAGGC CTGTATTTCA TGTTTCTTGT TGAACATGTC 1320 

CTCACATTGA TCAAACAATT TAAAGATAAG AAGAAAAAGA ATCAGAAGAA ACCTGAAAAT 1380 

GATGATGATG TGGAGATTAA GAAGCAGTTG TCCAAGTATG AATCTCAACT TTCAACAAAT 1440 

GAGGAGAAAG TAGATACAGA TGATCGAACT GAAGGCTATT TACGAGCAGA CTCACAAGAG 1500 

CCCTCCCACT TTGATTCTCA GCAGCCTGCA GTCTTGGAAG AAGAAQAGOI CATGATAGCT 1560 

CATGCTCATC CACAGGAAGT CTACAATGAA TATGTACCCA GAGGGTGCAA GAATAAATGC ^620 

CATTCACATT TCCACGATAC ACTCOGCCAG TCAGACGATC TCATTCACCA CCATCATGAC 1680 

TACCATCATA TTCTCCATCA TCACCACCAC CAAAACCACC ATCCTCACAG TCACAGCCAG 1740 

CQCTACTCTC GGGAGGAGCT GAAAGATGCC GGCGTCGCCA CTTTGGCCTG GATGGTGATA 1800 

ATGGGTGATG GCCTGCACAA TTTCRGCGAT GGCCTAGCAA TTGGTGCTGC TTTTACTGAA 1860 

GGCTTATCAA GTGGTTTAAG TACTTCTGTT GCTGTGTTCT GTCATGAGTT GCCTCATGAA 1920 

TTAGGTGACT TTGCTGTTCT ACTAAAGGCT GGCATGACCG TTAAGCAGGC TGTCCTTTAT 1980 

AATGCATTGT CAGCCATGCT GGCGTATCTT GGAATGGCAA CAGGAATTTT CATTGGTCAT 2040 

TATGCTGAAA ATGTTTCTAT GTGGATATTT GCACTTACTG CTGGCTTATT CATGTATGTT 2100 

GCTCTGGTTG ATATGGTACC TGAAATGCTG CACAATGATG CTAGTGACCA TGGATGTAGC 2160 

CGCTGGGGGT ATTTCTTTTT ACAGAATGCT GGGATGCTTT TGGGTTTTGG AATTATGTTA 2220 
CTTATTTCCA TATTTGAACA TAAAATOGTG TTTCGTATAA ATTTCTAG 



1 11 21 31 41 51 

I I I I I I 

MARKLSVILI LTPALSVTNP L""" ™~t,„^„™., ^^.z-t.^ttm ,t o™.. 

FYRYGENNSL SVEGFRKLLQ N 
EHHSDHDHHS HHNHAASGKN K 
SVSASEVTST VYNTVSE 

NESVSEPRKG FMYSRNTNEN PQECFNASKL LISHG 
RSCLIHTSEK KAEIPPKTYS LQIAWYGGFI AISII 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAI-lEMra GPLFSHLSSQ N 
TWKGLTALGG LVFMFLVEHV LTLIKQFi<DK XKKJJQKKPEN DDDVEIKKQL SKYESQLSTN 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEE3VMIA HAHPQEVYNE YVPRGCKNKC 
HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSKSC RYSREELKDA GVATLAWMVI 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDPAVLLKA GMTVKQAVLY 
NALSAMLAYL GMATGIFIGH YAENVSMWIP ALTAGLPMYV ALVDMVPEML HNDASDHGCS 
A GMLLGPGIML L 



CACAGCGTGG GAAGCAGCTC TGGGGGAGCT 
AGCTACGGCT GGGTGTGTAG AACGGGGCCG 



CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCT 
TCTGCSiGCCG GCTCCCAGGG AGATCTCGGT GGAACTTCAG A 
TTCAACCATG CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTGAGGCCT GGCTGCTGCT 
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AAAACTGCCC TGCTTCTACC 




TGGACACGGC T 

GCACAGCGGC ATCTACGTCT GCCATGTCAG 1200 

GTTCTTGACC CCCAGGAAGA 12 60 

GTC3GTGC3GTG TGATCGCCGC 1320 

TCCCGATACC ATCGGCGCAA 1380 

ATGACCCAGA AATATGAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

20 CCGSAGGCTG CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAQCCCGAG GOCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGOSGGC CGAGGAGGAG GAAGATCAGG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACGGG 1740 

25 CAATOGCATC TACATCAATG GGCGGGGACA CCTGGTCTCA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATOGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AQATGCICCC CATCCCACIG ACTGCTTGAC CTTTACCTCC 1920 

AACCKTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTQT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

30 TGACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AAOTGAACTG TGGTGTATGT GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGA7VAABGCA GGTATTTTCT 2220 

CAGACCCCAG AGCASIATTA ATGATGCAGA GGTIGGAGGA GAGAGGTGGA GACTGTGGCT 2280 

CAGACCCAGG TGTGOGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 2340 

35 TCTCCTACCA CTTCGGAQCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCICTGGGCC TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 2520 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGG ACGATATAGC CAGACCCTGT 2640 

40 

45 1 11 21 31 41 51 

I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPENPLDG SVLLRNAVQA 
DEGEVECRVS TPPAGSPQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 

50 VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 
HVSFLAEASV RGLEDCNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 
GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV V\rvGVIAALL 
FCLLWVWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 

55 AMNHFVQENG TLRAKPTGNG I 



I I I I 

A ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 
TGCATTCAGC AGCTTCCAAA T3GTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 
ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 
CTTTATQAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 
ATTACAGATA ACGTCSGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 
CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCI CTITGCAATT GTGAGGAAAA 
AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 
TGTGCCACCA TCACGACCTT TCKAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 
TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCaT GGACTGGAAA ACCTGGAATT 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 
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GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG 



CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGT 

rTCCTTGT A.ACTGCAAAG 



r GTTCCTTGGT C 

ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 
=1 ATAACAACCT C 
3 TAAATCTTAA A 

3 ATTTACTAAC CCAGATTGAC CTTGAGGATA A 

GACAGATGAC ATCCTCTGCA CTTCCCCCGG GmTCTCGAC AAAAAGGAAT T 
^ ATTCTCTGTC CAGGTTTAGT AA.ATAACCCA TCCATGCCAA C 

C CTGCAACAAC AACAAATACG GCIGATRCTA TTTTAOGATC 2280 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

AAAOAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATCGCCAT AAAACCACTC ATCACRCTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 

GCACATGGTG AGCCCCATGQ TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2530 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 

GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAARCATTGC 2820 

TCAGCTCCAG CCTGATATGO AGOCACATTA TCCTGGAGCC CACMAAGAGC TGAAGTTAAT 2830 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 

TTTTCAAOTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

AACATAGATG GAGAGTTTGa GGOCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAQTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACIATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 

GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAARTC AATCCTTGCA TGTAAATC 3238 

Seg ID NO: 181 Protein Sequence 



I I I I I 

MKLKIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA » 

VPPSRPFQLS LLNNQLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 

HNSLEILKED TFHGLENLEF LQADNNFITV lEPSAFSKLN RLKVLILKDN AIESLPPNIF 

RFVPLTHLDL RGNQLOTLPV VGFLEHIGRI LDLQLEDNKW ACKCDLLQLK TWLENMPPQS 

IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 

TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 

PPQNPRKLIL AGKIIHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVQLQQ WIQKLSKNTV 

TDDILCTSPG HLDKKBLKAL NSEILCPQLV NNPSMPTQTS YLMVTTPATT TNTSDTILRS 

LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRVK KKQVDEQMRD NSPVHLQYSM 

YGHRTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNBKE GSDAKHLQRS 

LLEQEHHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 

QLQPDMBAHY POAHEELKLM ETLMYSRPRK VLVEQTKNEY PELKANLHAE PI 



I I I I I 

ATGGCGGGTG CGGGCCCGAA GCGGCGCGCG CTAGCGGCGC CGGCGGCCGA G 
GAGGCGCGGG AGAAOATGCT GGCCGCCAAG AGCGCGGACG GCTCGGCGCC GGCAGGCGAG 
GGCGAGGGCG TGACCCTGCA GCGGAACATC ACGCTGCTCA ACGGCGTGGC CATCfiTCGTG 
GGGACCATTA TCGGCTCGGG CATCTTCGTG ACGCCCACGG GCGTGCTCAA GGAGGCAGGC 
TCGCCGGGGC TGQCGCTGGT GGTGTGGGCC GCGTGCGGCG TCTTCTCCAT CGTGGGCGCG 
CTCTGCTACG CGGAGCTCGG CACCACCATC TCCAAATCGG GOGGCGACTA CGCCTACATG 
CTGGAGGTCT ACGGCTCGCT GCCCGCCTTC CTCMGCTCT GGATCGAGCT GCTtaTCATC 
CGGCCTTCAT CGCAGTACAT CQTGGCCCTQ GTCTTCGCCA COTACCTGCT CAAGCCGCTC 
TTCCCCACCT GCCCGGTGCC CGAGGAGGCA GCCAAGCTOG TGGCCTGCCT CTG03TGCTG 
CTGCTCAOGG CCGTGAACTG CTACAGCGTG AAGGCCGCCA CCCGGGTCCA GGATGCCTTT 
GCCGCOGCCA AQCTCCTGQC CCTGGCCCTQ ATCATCCTGC TGGGCTTCGT C' 
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ACTTCGGQ.!^. CTATCACCTG 

r TCACATCCTC CAGGCTCTTC TTCGTCGGGT 
TCCATCCTCT CCATGATCCA CCCACAGCTC CTCACCCCCG TGCCGTCCCT CGTGTTCACG 
TGTGTGATGA CGCTGCTCTA CGCCTTCTCC AAGGACATCT TCTCCGTCAT CAACTTCTTC 
AGCTTCTTCA ACTGGCTCTG CGTGGCCCTG GCCATCATCG GCATGATCTG GCTGOSCCAC 
AGAAAGCCTG AGCTTQAGCG GCCCATCAAG GTGAACCTGG CCCTGCCTGT GTTCTTCATC 
CTGGCCTGCC TCTTCCTGAT CGCCGTCTCC TTCTGGAAGA CACCQGTGGA GTGTGGCATC 
GGCTTCACCA TCATCCTCAG CGGGCTGCCC GTCTACTTCT TOSGGGTCTG GTGGAAAAAC 
AAGCCCAAGT GGCTCCTCCA GGGCATCTTC TCCACGACCG TCCTGTGTCA GftAGCTCATG 



HAGAGPKBRA LAAFAAEEKE EAKEKMLAAK S 

GTIIGSGIPV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI S 
LEVYGSLPAP LKLWIELLII RPSSQYIVM. VPATYLLKPL F 
LLTAVNCYSV KAATEVQDAF AAAKLLALAL IILLGPVQIG KGDVSNLDPN F 
GNIVLALVSG LPAVGGWNYL NPVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYPTTL 
STEQMLSSEA VAVDPGNYHL GVMSWIIPVF VGL3CFGSVN GSLPTSSRI* PVGSREGHLP 
SILSMIHPQL LTPVPSLVPT CVMTLLYAPS KDIPSVINFP SFFNWLCVAL AIIGMIWLRH 
RKPELBRPIK VNLALPVPPI LACLPLIAVS PWKTPVECGI OPTIILSGLE VYPPGVWWKN 



30 KPKWLLQGIP STTVLCQKLM QWPQET 

35 



Seq ID NO: 184 DHA Sequence 
Nucleic Acid Accession #: NM_00 
coding sequence: 1..822 



21 31 41 51 

11 1 1 

A GTATCTTTGA GGGACTCCTG AGTGGGGTCA ACAAGTACTC CACAGCCTTT 
QGGCGCATCT GGCTGTCTCT GGTCTTCATC TTCCGCGTGC TGGTGTACCT GGTGACGGCC 
QASCGTGTGT GGAGTGATGA CCACAAGGAC TTCGACTGCA ATACTCGCCA GCCCGGCTGC 
TCCAACGTCT OCTTTGATGA GTTCTTCCCT GTGTCCCATG TGCGCCTCTG GGCCCTGCAG 
CTTATCCTGG TGACATGCCC CTCACTGCTC GTGGTCATGC ACGTGOCCTA CCGOGAGOTT 
CAGGAGAAGA GGCACCGAGA AGCCCATGGG GAGAACAGTG GGOSCCTCTA CCTGAACCCC 
GGCAAGAAGC GGGGTGGGCT CTGGTGGACA TATGTCTGCA GCCTAGTGTT CAAGGCGAGC 
GTGGACATCG CCTTTCTCTA TGTGTTCCAC TCATTCTACC CCAAATATAT CCTCCCTCCT 
GTGGTCAAGT GCCACGCAGA TCCATGTCCC AATATAGTGG ACTGCTTCAT CTCCAAGCCC 

AACCTCCTGG AGCTCATCTA CCTGGTGAGC AAGAGATGCC ACGAGTGCCI GGCAGCAAGG 

3 CCATGTGCAC AGGTCATCAC CCCCACGGTA CCACCTCTTC CTGCAAACAA 

TTTCGGGTGA CCTCATCTTT CTGGGCTCAG AC3W3TCATCC TCCTCTCTTA 
\ RCCATCTTGT GA 



FRVLVYLVTA ERVWSDDHKD PDCNTRQPGC 
WMHVAYREV QEKKHREAHG ENSGRLYLNP 
SPYPKYILPP WKCHADPCP NIVDCFISKP 
NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 

,GSDSHPPLL E 



1111 
CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCBAGGC TTCCTCCTCC TCACCCTCCT 
CQCCCTGCTG GCGCTCACCT CCGCGGTCGC CA7VAARGAAA GATAAGGTGA AGAAGGGCOG 
CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 
CGGOSTGGGT TTCCGCGAGG GCACCTGGGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 
GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 
TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 
CAATGCTCAG TGCCAGGAGA CCATCCGCX3T CACCAAGCCC TGCACCCCCa AGACCAAAGC 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACX3 CCAAGCCTGG ATGCCAAGGA 
GCCCOIGGTQ TCACATQQGG CCTGGCCACG CCCTCCCTCT CCCAGGCCOG A 
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10 

15 
20 



CACCAGTGCC TTCTGTCTCC TCGTTAGCTTT TAATCAATCA TQCCCTQCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TOAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAOCCCOCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTSAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 



MQHRGPLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREX3T 
CGAQTQRIEC RVPCNWKKEP GADCKVKPEN WGACDGGTGT KVEQGTLKKA RYNAQOQETI 
RVTKPOTPKT KAKAKAKKGK GKD 



ATGCCTCTGC CCTGGAGCCT TGCGCTCCCX3 C 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACQTCA GCCTGGGGTC 
TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGA^iiACAG CAAGGGAGTC 
TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGC-ACC A 
AGATQCTTTC CAGGATACAC CGGGAAAACC T 



: ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 
ATAAACIGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CaTTTGGAAG CTACTACTGC 
AAATGTCACA TTGGTTTOGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 
AATGAATGTA CTATGGATAG CCATAOSTGC AGCCACCATG CCAATTGCTT CAATACCCAA 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAOCACCTG GTACCATCAA AGACAGAATC 
AAGAAGTTGC TTGCTCACSiA AAACAGCATG AAAAAQAAGG CAAAAATTAA AAATGTTACC 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTICAA CTATGAAGAG 
ATAGTTTCCA GAGGCOGGAA CTCTCaTGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 
QAQaaQCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCQA 
AGCCTGCGAG GAGATGTGTT TITCCCTAAG GTGAATGAAG CRGGTGAATT CGGCCTGAIT 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACaTA AAGATTTAAA TATCTCGGIT 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 
TGGAATCCTG CTGATCGAGA TAATGCTATI GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 
A AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 
I TOCTCTTTGA TTACCGOCTG GCOSGAGACA AAOTCGGGAA ACTTCGAGTG 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACQAGTGA GGATGAAAAG 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 
GAAGCAGAAC GTGGCAAGGO CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAQGCTTAT GTCCAGATAQ CCTTTTATCT GTGQATGACT GAATGTTACT ATCTTTATAT 
rrGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 



I I I I I 

MPLPWSLALP LLLSWVAGGP GNAASAEHHG LLASARQPGV C 
CEATCEPGCK FGECVGPKKC RCPPGYTGKT CSQDVNECGM KPRPCQHRCV N 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRliAPKGR DCLDIDECAS 
GKVICPVNRR CVMTFGSYVC KCHIGFELQSf ISGRYDCIDI NECTMD3HTC SHHANCFNTQ 
GSFKCKCKQG YKGKGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKir/T 
PEPTRTPTPK VNLQPFNVEE IVSRGGNSHG QKKQNEEKMK EGLEDEKEEE KALKNDIEER 
SLRGDVPPPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDPD 
WNPADRDNAI GPYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIP EAERGKGKTQ EIAVDGVLLV 
S VDD 
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AGAGACTCAA GATGATTCCC 

r CATTATGACA 



CCATCTGTGG A 
ATQAGAATGG AAGGAATGAA AQGCTGCCCA GCAQTTTTGC 



ATCCTTCACT I 
AACTTGGATT CTQATATCCG TAGAGGTTTG QAQAGCAACG 



TTCAAGACTT CATTGAAGCA GAAGATGACC TTTCATCTIT TAGAGCAGCT GCCATCACAT 
OGGACATATT GGAGGCCCTT GGAAGAGACG GTCACTTCAC ACTCTTTGCT CCCACCAATG 
AGGCTTTTGA GAAACITCCA CGAGGTGTCC TAGARAGGIT CATGGGAGAC ARAQTQGCTT 
CaSSAGCTCT TATGAAGTAC CACaTCTTAA ATACTCTCCA GTGTTCTGAG TCTATTATGG 
GAGGAGCAGT CTTTGAGACX3 CTGGAAGGAA ATACAATTQA GATflCGATGT GACGGTQACA 
GTATAACAGT AAATGGAATC AAAATGGTGA ACAAftAAGGA TATTGTGACA AATAATGGTG 
TGATCCATIT GATTGATCAG GTCCTAATTC CTGATTCTGC CAAACAAGTT ATTGAGCTGG 
CTGGAAAACA GCAAACCACC TTCaCGQATC TTGTGGCCCA ATTAGGCTTG GCATCTGCTC 
TGAGGCCAGA TGGAGAATAC ACTTTGCTGG CACCTGTGAA TAATGCATTT TCTGATGATA 
CTCTCAGCAT GGTTCAGCGC CTCCTTAAAT TAATTCTGCA GAATCACSTA TTGAAAGTAA 
AAQTTGGCKT TAATGAGCTT TACSUCGGGC AAATACTGGA AACCATCGGA GGCAAACAGC 
TCAGAGTCTT CGTATATCGT ACAGCTGTCT GCATTGAAAA TTCATGCATG GAGAAAGOGA 
GTAAGCAAGG GAGAAACGGT GCGATTCACA TATTCCGCGA GATCATCAAG CXAGCAGAGA 
AATCCCTCCA TGAAAAGTTA AAACAAGATA AGCGCTTTAG CACCTTCCTC AGCCTACTTG 
AAGCTGCAGA CTTGAAAGAG CTCCTGACAC AACCTGGAGA CTGOACATTA TTTGTGCCAA 
CCAATGATGC TTTTAAGGGA ATGACTAGTG AAGAAAAAGA AATTCTGATA CGGGACAAAA 
ATOCTCTTCA AAACATCATT CTTTATCACC TGACACCAGG AGTTTTCATT GGAAAAGGAT 
TTOAACCTGG TGTTACTAAC ATTTTAAAGA CCACACAAGG AAGCAAAATC TTTCTGAAAG 
AAGTAAATGA TACACTTCTG GTGAATGAAT TGAAATCAAA AGAATCTGAC ATCATGACAA 
CAAATGGTGT AATTCATGTT GTAGATAAAC TCCTCTATCC AGCAGACACA CCTGTTGGAA 
ATGATCAACT GCTGGAAATA CTTAATAAAT TAATCAAATA CATCCAAATT AAGTTTGTTC 
GTGGTAGCAC CTTCAAAGAA ATCCCCGTGA CTGTCTATAC AACTAAAATT ATAACCAAAG 2040 
TTGTGGAACC AAAAATTAAA GTGATTQAAG GCAGTCTTCA GCCTATTATC AAAACTGAAG 2100 
GACCCACACT AACAAAAGTC AAAATTGAAG GTGAACCTGA ATTCAGACTG ATTAAAGAAG 2160 
GTGAAACAAT AACTGAAGTG ATCCATGGAG AGCCAATTAT TAAAAAATAC ACCAAAATCA 2220 
TTGATGGAGT GCCTGTGGAA ATAACTGAAA AAGAGACACG AGAAGAACGA A 
GTCCTGAAAT AAAATACACT AGGATTICTA CTGGAGC 
AQAAATTGTT ACAAGAAGAG GTCACCAAGG TCACCAAATT C 
ATTTATTTGA AGATGAAGAA ATTAAAAGAC TGCTTCAGGG A 
TGCAAGCCAA CAAAAAAGTT CAAGOTTCTA GAAGACC 
GAAAATCCAA AAACCAGAAA AAAATGTTTA TACAACCCTA A 

k GAGCCAAGTT GACTTCAGGA ACTGAAACAT CAGCAC.WIG A 



TGTGGAGTTA GCCTCCTGTG GTAAAGGAAT T 
TCATCTTGAC ATTAAAAGTT CTGGCTAACT TTGGAATCCA TTAGAGM 
ATCGAAGAGT TGTGAACTGT T 



QCIAAGTTAT 

ATTTCTATAT GAGTGGGTTT 
CTCAGAATGT TTGTCATATG CTTCTTGCAA 
AAAACCAITT TTCAGATATA AAGAGAATTA CTTCAAATTG AGTAATTCAG 



1 11 21 31 41 51 

I I I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKK5fFSTC 
KNWVKKSICG QKTTVLYBCC PGYMRMBGMK GCPAVLPIDH VXGTLGIVGA TTTQRYSDAS 
KIiRBEISGKG SFTYFAPSlslE AWDNLDSDIR RGIiESNVMVE IiUJALKSHMI MKKMLTimLK 
NGMIIPSMYN NLGLPINHYP NGWTVNCAR IIHGNQIATN GWHVICRVL TQIGTSIQDF 
lEAEDDLSSP HAAAITSDIL EALGRDOHPT LPAPTNEAPE KLPRGVLERP MODKVASEAL 
MKYHILNTLQ CSESIMGGAV PETLEGNTIE IGCBGDSITV NGIKMVNKKD IVTNNGVIHL 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 
9QRLLKLILQ NHILKVKVGL MELYNGQILE TIGGKQLRVF VYETAVCIEN SCMEKGSKQG 
RNGAIHIFRE IIKPAEKSLH EKLKQDKRPS TPLSLLBAAD LKELLTQPGD WTLPVPTHDA 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VPIGKGFEPG VTNILKTTQG SKIPLKEVND 
TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGHDQL LEILNKLIKY IQIKFVRGST 
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r TKIITKWEP KIKVIEGSLQ PIIKTBGPTL TKVKIEGEPE PRLIKEGETI 
I KKYTKIIDGV PVEITEKETR EERIITGPEI KVTRISTGGG ETEETLKKLL 

QEEVTKVTKF lEGGDGHLPE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 



I I I I 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG 
AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 
GACGGGCGTC TGCGGCTOGC GCGACTAGCG CTGGTACTCC TGGGCTGG3T CTCCTCGTCT 
r CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 
C GGACCAGTGC CCCGCGCTGT GCX3AGTGCTC CGAGGCAGCG 
\ CCGCAATCTG ACCGAGGTGC CX31CX3GACCT GCCCGCC7IAC 
GTGCGCAACC TCTTCCTTAC CX3GCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 
CGCCGGCCGC CGCTGGCGGA GCTGGCCGCXS CTCRACCTCA GCGGCAGCOG CCTGGACGAG 
GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC flGCrOGACCT CAGCCACAAC 
CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TOGGGCaGCA ATGCCAGCGT CTCGGCCCCC 
AGTCCCCTTG TGGAACTGaT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 
OGGAGCTTOG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCOGTGCACT GCAGGGGCTC 
OGOOSCITGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCXaVA 
CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTOGC TGGTGAGCCT GACCTAOSTG 
TCCTTCOQCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACaATGC CCTCAAGGTC 
CTTCACAATG QCACCCTOaC TGAGTTG<»A GGTCTACCCC ACATTAGGGT TTTCCTGGAC 
AACAATCCCT GGGTCTGCGA CTGCCACATG GCftGACRTGG TGACCTGGCT CAAGGAAACA 
GAGGTAGTGC AGGGCAAAOA CCGGCTCACC TGTGCATATC OGGAAAAAAT GAGGAATCGG 
QTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCXITG 
CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGG03CTAT TTTCCTCCTG 
GTTTTGTATT TGAACCGCAA GGOGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 
AOOGATCACA TGGAAGGGTA TCRTTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 
AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 
CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 
TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 
TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCETC 
TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 
TGGGCTTCTT GCTGTCTGTC TCTCTCTCAO TACaWSTTCSUl GGTGTAOCAA GTGTACCCAC 
ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 
TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 
CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 
AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACARAAA ATAACTTGCA ACTTCATAAC 
TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGA.WlTGT ACTGATTTTT 
TTTTAATAAA CTGCATCOAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 
ATTCTTAAAA GAA 



I t I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLQWVS SSSPTSSASS FSSSAPFLAS AVSAQPP 
QCPALCECSB AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA 
AALULSGSRL DEVRAGAFEK LPSLRQLDLS HNPLADLSPF APSGSMASVS A 
KHIVPPEDER QNRSPEa^WV AALIAQRALQ QLRRLELASN HPLVLPRDVL AQLPSLRHLD 
LSMNSLVSLT YVSFENLTHL ESLHLBDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 
HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 
IVLALIGAIP LLVLYLNRKG IKKWMHNIRD ACRDHMBGYH YRYEINADPR LTNLSSNSDV 



Seq ID NO: 194 DNA Sequence 
Nucleic Acid Accession #i NM_014400 
Coding sequences 86.. 1126 



1 I I 1 1 1 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACC 
GAOGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 
GATCTGGACT GCAGGCTGGC IGCTGCTGCT GCTGCTTCGC GGAGGAGOSC AGGCCCTGGA 
GTBCTACAGC TGCSTGCAGA AAGCAGATGA CGGATGCTCC COSAACAAGA TGAAGACAGT 
GAAGTGCSCG CCGGGCGTGG ACGTCTGCaC CGAGGCOGTG GQGGCGGTGG AGACCATCCA 
CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTOSGGA CTCCCCGGCA AGAATGACCG 
CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCXaGCTG CAGCAATGCG CTCAGGATCG 
CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 
ATACCCQCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGQ AGGCGTGCCA 
GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCI ACAAGGGCTG 
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TGGCTCCTGT TGCCAGGGOT CCCBCTOTAA CTCIGACCTC CGCAACAftGA CCTACTTCTC 
CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CXTACGRCTG TGGCCTCAAC 
CaCATCTOTC ACCACTTCTA CCTOGOCCCC AGTGAGACCC ACATCCACCA CKAAACCCAT 
GCCAG03CCA ACCAGTCAQA CTCOGAQACA GGGAGTAGAA CAOQAGGCCT CCCGGGATQA 
GGAGCCCAGG TTGACTGGAG G 

TCCTGCAAAA CGGGGGCCCC AOCAGCCCCA TAATAAAGGC T 
ATTGGCAQCC CTTCTQTTGQ CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 
AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 
CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 
CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 
3 GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 1380 



AGGATGCTAA GCTTCCTACT CACTTTCTCC T 

ATCGGTTCCC CATATGTCTT CCTTACTAGA C 
CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 



I I I I I I 

MDPARKAGAQ SMIWTAGWLL LIiLLRGQAQA LECXSCVQKA DDGCSPl 

CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 
SRALDPAGNE SAVPPNGVEC YSCVGLSEEA CQGTSPPWS CYNASDHVYK GCPDGNVTLT 
AANVTVSLPV RGCVQDEFCT RDGVTGPGPT LSGSCCQGSR CNSDIiRNKTY FSPRIPPLVR 
LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASE DEEPRLTGGA 
AOHQDRSNSG QVPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 



I I I I I I 

ACCTAAAACC ITGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAQG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTOACTC TCCTQGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGQGTATAA TQGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GASGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTOGC TACGGATCAC GAGGCCGAOT GTTTGTCCAT SOO 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAOAA AACTOTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCaCCT TTATCTACAA TAGCACCCAA AATGCAACTG CaTCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 
CTACAGAACC AGATGTGCAG CCTCAGAAOT OCATGGGATG TAATCACAGA CTCTGCTGAC 
TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 
GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 
GCTGACAOAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 
ATTCATACCT TCGTGGCCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAOCTA 
CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 
A ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 



3 ATGCTTTCAG TAGAATTTCC 
G TGAAAATGTC 
G CAACGACACT 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 
\ TACCCATCAT 

G CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 
GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 
70 TATGCCRATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 
GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAG 
GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCITTGCTG 
TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 
CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 
75 GCTOCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 
AGCTCAQGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 
CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 
TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 
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AGTAAAAGTC TACAGAATAT 

A CGTTCTCACC 



G GAACTCCTTA C 

^ ATCATTTGCC TTATTATAGT TGTGACACAT 
ATAAATATCC AAAGTGTCTT CCTTCTTAGA T 

CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGT 
ATACAGATAA GATTTTTACA T 

r TTAAAGTAAT GTCTTTAAAG 3180 

A GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTCTTTTATT GAGGTGGAAA 32 40 

3 GTCTGCATTA TAACTGTCIG TGTGAAGCAA 3 3 00 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3350 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAAIGCAARG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTRATGCTC AGAGATCTTT 3 4 BO 

TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTIGTCTCT TCSTACCGGT 3540 

TTTATGACAA AGGTCTATTG SATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

r TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCK TATAATGCCT 3660 
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MTQRSIAGPI CWLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 
IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 
GDDPlfTLQYR GCGKEGKlflH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNMD 
KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIV NSTQNATASI 
MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 
TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG lASFDSKGEI 
RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF EWEKLNGKA YGSVMILVTS 
GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 
SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDIMFLVTW QASGPPEIIL 
FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 
AVPPATVEAF VERDSLHFPH PVMIYAMVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 
AGADVIKNDG lYSRYFFSFA ANGRYSLKVH VNHSP5ISTP AHSIPGSHAM YVPGYTANGH 
IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKIl DLEAVKVEEE 
LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKSNPQQ AGIREIFTFS 
PQISTNGPEH QPNGETHESH HIYVAIRAMD RNSLQSAVSN lAQAPLFIPP NSDPVPAHDY 
LILKGVLTAM GLIGIICLIl WTHHTLSRK KRSDKKENGT KLL 



TTTTCTTAGA CATTAACIGC AGACGGCTGG CAGGATAGAA GCAGCOGCTC ACTTGGACTT 
TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 
A TTGGITCATG GAGAATIGCG A 



AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 
TTACITCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCICTGGA GTGGGAATCG 
ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 
CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGICGGGCT CTAAATGCCC 
AAGGACTAGA TGTAGAGAAA CCACITATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 
ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 
ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 
T CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 
^GTCCGT ACTTIGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 
I TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 
GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 
CAGCACGTAT TGAAGAAAAT ATTTIAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 
TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 
GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 
AGGCICTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 
CTGAATTTCa CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 
AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 
AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 
ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATQGGACGT AAOSATGGTG 
GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 
ATTCIACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTQGCCATA GATGAATACA 
CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 
CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCaGTTCTTC ACCITCCXSTG CTIGTCTCCG 
CTAGRACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCRACKTG 
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TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAftCA GATACCTCCT GGAGTATACC ACATCTCCCT GSIACTTACA GACAGTCAGA IBOO 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

5 CAGGSAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCAOiA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

10 TGGAAGGCAC TTCAGGAATG GAAATGRCCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GRGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 
TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA A 

15 TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCTTGTGGGC TCCGTGGGTT 2580 
GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT O 
TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA G 
CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA T 
CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG A 

;CCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT A 
C TTACTCGGCT T 

C ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT T 
CTGGCAACCT AGCTGGCCCA ACGCAGCTAC CAGGGTCACA TACTATGCTC T 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 312 0 

C TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

;TTCTCT CATAAACTGA TCAOSATTAT AAATTAARTG TTTGGGTTCA 3240 

A GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 



1 11 21 31 41 51 

I I I I I I 

MMGLFPRTTG ALAIPVWIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE W" 
GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPPGIF WDKNTGDIN I 
PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVPSQQ IFMGEIEENS AaribbVMiljrj 
ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDHEQA SSYRLWSGA 
DKDGEGLSTQ CEC3JIKVKDV NDNPPMPRDS QYSARIEENI LSSELLRPQV TDLDEEYTDN 
WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEPHQSVI 
SHYHVQSTPV TIQVINVREG lAFRPASKTF TVQKGISSKK LVDYILGTYQ flIDEDTNKAA 
SNVKYVMGRN DGGYLMIDSK TAEIKPVKHM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 
D NCPTAVLEKD AVCSSSPSW VSARTLHNRY TGPYTFALED QPVKLPAVWS 
L LRAQBQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 
TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCX3AGSTGG VTGGFIPVPD 
GSEGTIHQWG lEGAHPEDKE ITNICVPPVT ANGADPMESS EWCTHTYARG TAVEGTSGME 
MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 
DGAISMNPLD SYPSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 
LDDSPLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG lESCGHPIEV QQTGPVKCQT 
LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 
3 SVPGNLAGPT QLRGSHTMLC TEDPCSRLl 



3 GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 
3 GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 
ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA AOSCGGCGGA GCTGTGAGCC 

3GATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 

S AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 



TTTGACTTAA 
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SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO. 202 DNA Sequence 
Nucleic Acid Accession AA172056 
Coding sequence: 121.. 339 



TTTAGCCACC AGAGGANTTC TCTTGAAATA CCCAAAATCC ATCAGTATCT TGAATCATGC 
TGGATTTTGA AGAATTCTTA AGAAGCCATG TAAAGGGGGC T 
ATGTTTTTTA TACAGAAAGG AGAATGCAGA ATGGTCAGAC TATCATGCAC T 
GATTTCAAGA AATTACAGGA AAACTTTCCA AAGTTCC 

A AGATAAGTTT AGTTTTATGG AAGACTTTTA T' 



TCTGQAAGAA CAATCAAATA 



G TTATTTCATC TGAAAATTTC AAGCAACTTT GTTCAACATT 720 

3TCTTTT GAATTTATGC ATGGTAGTAA ACATICTCTC 780 

TATTAACTTT ATTACCTAAG GCTAAACCTA AAATTTTTAA GCSlAAftTTAG AAAAATAGTC 840 

A GTTTGTTACA TTTAGTATTT TCCCAATAAA ATTGGTCGTT 900 
r TATTIGGAGA GTCTGTGCAA AATGTCACTA AAAATftftATT Al 



Seq ID NO: 203 D 
Nucleic Acid Accession #: NM_005656.1 
Coding sequence: 57.. 1535 



A ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

C AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTOTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAARGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACSGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCSGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 
TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 
ATGGAGCCGG ATACCSAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 



AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 
CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 
TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 
CAGCCATGAT CTGTGCCGGC 7TCCTGCA0G GGAACGTCGA TTCTTGCCAG GOTGACAGTG 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 

ACTGGATTTA TCGACAAATG AAGGCAAACG G 

A GAAAACAATG GGGCTGGTTT TGCTTCCCCG T 

TGCCATACTG TGCAGGCTGC AGTGGCl 

CCGCAAGQGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG A 
GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTITGGA 



GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCIGGGGCCA C 
TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 
GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 



GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC T 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA STTTGGCACC 234 0 

ATGTCQGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
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i PAQYYPSPVP QYAPRVLTQA 
j AAGLLWKFMG SKCSMSGIEC 
4 YSSQRXSWHP VCQODf^ENY 
Y KMNFYSSQGI VDDSGSTSFM KLNTSAGNVD lYKKLYHSDA CSSK-^iWSLR 
CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQtrfflV C3GSIITPEW IVTA-^HCVEK 
1^ FAGILRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 
G MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDMLI 
S NIWWLIGDTS WGSGCAKftYR PGVYGNVMVF 



20 GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG 
AGGGGCTGAG TTTGCCAGGG CCCflCTTGAC CCTGTTTCCC 
GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG 



CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCX3CCT 300 

25 CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG SACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTG6 GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCRACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

30 TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCKGCTC 660 

AQCGGCAGTC ACCTGTTCAC CTGTIGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

35 AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

A AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

PCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 



CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 
TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCIACCAGCC TCAGCTGGCC 
ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCPACCIGGC GA.CCCGACTC 
TGGCTACGCA ACGSGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGIGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC ASTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG T3GAGGACGG GGTGGCAGAC 
CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT QA.GTGCACCA 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCICTA CCGGCACCGG 
AACAGCATGA AAGTCTTCCI GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 



A GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTICACT 
GAGTCAGAGA AGAQGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 
I CTGTGGTGTG AGAGCIGACT 

TCCAGAGGAC GCTGCCCTGG C 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC C 

GGCCAGCTGG CCTGCTGCTG TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2 940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT QGTGGAACAG 3 000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3 060 

GAGAGGGRAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCICCCA 312 0 

G GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

C ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3 240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT QTAGTIGTTG 3 300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT (3IGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 
CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT G 
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CCACCATATC CACCCTCGCI CCATCTTTGA ACTCAflACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTRATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 




RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL P3GEYQELLW CADAEKKQQC 120 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 24 0 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTSGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TMSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 4 BO 

PRVHIIEELQ IFSS3QPVQN LLLDTHRGLL YAASHSGWQ VPMAMCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAEVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LEVLFLLYHH RNSMKVFLKQ GECASVHPKT CPWLPEETR PLNGLGPPST 780 

PLDHRGYQSL SDSPPGSRVF TESEKEPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 . A method of detecting a bladder cancer-associated transcript in a cell from a patient, 
5 the method comprising contacting a biological sample from the patient with a polynucleotide 
that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables lA-13. 



2. The method of Claim 1 , wherein the biological sample comprises isolated nucleic 

10 acids. 



The method of Claim 2; 

a) wherein the nucleic acids are mRNA; or 

b) fiirther comprising the step of amplifying nucleic acids before the step of contacting 
the biological sample with the polynucleotide. 



The method of Claim 1 , wherein the polynucleotide: 

a) comprises a sequence as shown in Tables 1 A-13; or 

b) is immobilized on a solid surface. 

The method of Claim 1 , wherein the patient is: 

a) undergoing a therapeutic regimen to treat bladder cancer; or 

b) suspected of having bladder cancer. 



25 6. An isolated nucleic acid molecule consisting of a polynucleotide sequence as shown 
in Tables lA-13. 



7. The nucleic acid molecule of Claim 6, which is labeled. 



30 8. An expression vector comprising the nucleic acid of Claim 7, 
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9. A host cell comprising the expression vector of Claim 8. 

10. An isolated polypeptide which is encoded by a nucleic acid molecule having 
5 polynucleotide sequence as shown in Tables lA-13. 

11. An antibody that specifically binds a polypeptide of Claim 10. 

12. The antibody of Claim 1 1 , fiirther conjugated to an effector component. 

10 

13. The antibody of Claim 12, wherein the effector component is a fluorescent label. 

14. The antibody of Claim 12, wherein the effector component is a radioisotope or a 
cytotoxic chemical. 

15 

1 5 . The antibody of Claim 1 1 , which is 

a) an antibody fragment; or 

b) a humanized antibody 

20 16. A method of detecting a bladder cancer cell in a biological sample from a patient, the 
method comprising contacting the biological sample with an antibody of Claim 1 1. 

1 7. The method of Claim 1 6, wherein the antibody is fiirther conjugated to an effector 
component. 

25 

18. The method of Claim 1 7, wherein the effector component is a fluorescent label. 

19. A method for identifying a compoimd that modulates a bladder cancer-associated 
polypeptide, the method comprising the steps of: 
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a) contacting the compound with a bladder cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables lA-13; and 

b) determining ttie functional effect of the compound upon the polypeptide. 

5 

20. A drug screening assay comprising the steps of 

a) administering a test compound to a mammal having bladder cancer or a cell isolated 
therefrom; 

b) comparing the level of gene expression of a polynucleotide that selectively hybridizes 
10 to a sequence at least 80% identical to a sequence as shown in Tables 1 A-1 3 in a 

treated cell or mammal with the level of gene expression of the polynucleotide in a 
control cell or mammal, wherein a test compound that modulates the level of 
expression of the polynucleotide is a candidate for the treatment of bladder cancer. 
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